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Abstract. We prove a T(l) theorem for bilinear singular integral operators (trilinear 
forms) with a one-dimensional modulation symmetry. 



1. Introduction 

The T(l) Theorem is a criterion that gives necessary and sufficient conditions for the 
boundedness of non-convolution singular integral operators. It arose as a culmina- 
tion of decade long efforts to understand the Cauchy integral operator on a Lipschitz 
graph and the related Calderon commutators. In the original statement of the theo- 
rem, proved by G. David and J.L. Journe (|l9])) the necessary and sufficient conditions 
are expressed by the requirement that some properly defined functions T(l) and T*(l) 
belong to BMO - hence the name of the theorem - together with the so called weak 
boundedness property. This latter condition requires the bounds when tested weakly 
on a restricted class of bump functions 

l(^(<^x,i?),<^j/,i?)| < 

where (px,R{t) = (p{R~^{t - x)). 

In ([2Z]), E. Stein reformulated the necessary and sufficient conditions into what 
he called the restricted boundedness property. This amounts to the existence of 
bounds when strongly tested on the same class of bump functions, that is 

\\Ti^.,R)h < R'/' 

and likewise for T*. Both forms of the T(l) theorem will be used in this paper and 
further developed in a bilinear version to study certain modulation invariant bilinear 
singular integrals. 

A basic operator in the study of the Cauchy integral on a Lipschitz graph is Calderon's 
first commutator. This operator can be written as a superposition of bilinear singular 
integral operators of the form 

Taifi, f2)ix) = p.v. I fi{x -t)f2{x - at)'^ 
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with parameter a ^ {0, 1}, called bilinear Hilbert transforms. One of Calderon's early 
attempts to bound his commutator was to show the boundedness of the bilinear Hilbert 
transforms from x to L^. However, he gave up on this approach and proved 
bounds on the commutator by different means |2], [2]. 

The bounds for the bilinear Hilbert transform conjectured by Calderon remained an 
open problem for more than 30 years. M. Lacey and C. Thiele proved [I9],[20], that 
the bilinear Hilbert transforms are bounded from LP^ x LP'^ to U for 1 < pi,p2 < 
oo, = + ^'^d 2/3 < p < oo. Appropriate control on the growth of the 
constants associated with these bounds as a approaches the forbidden values {0, 1}, was 
established in |29j. This step was necessary in order to complete Calderon's program 
of estimating the commutator as superposition of bilinear Hilbert transforms. Thiele's 
results were strengthened to uniform bounds in some range of exponents by L. Grafakos 
and X. Li [H], [21]. 

The main feature that distinguishes the bilinear Hilbert Transform from its classical 
linear counterpart is the fact that the former has modulation invariance. A similar 
property is shared by Carleson's maximal operator, which controls convergence of the 
Fourier series, The resolution of both problems resides in the representation of 
these operators in a wave packet frame that is itself invariant under modulation. 

The natural question regarding bounds on more general bilinear singular integral 
operators than the bilinear Hilbert transform, where the kernel 1/t is replaced by more 
general Calderon- Zygmund kernels was first addressed by J. Gilbert and A. Nahmod 
(see [H], p5], [16]). They proved bounds for the class of kernels K{t) which are x- 
independent. Uniform bounds in a for these operators were then shown by C. Muscalu, 
T. Tao and C. Thiele ([24j). 

The purpose of the current article is to address the case of kernels K{x, t) that have 
both t and x dependence; this corresponds to the non-convolution case in the classical 
linear theory. The results we obtain, in particular Theorem 12.71 are different in nature 
from the bilinear or multilinear T(l) theorems of M. Christ and J.L. Journe [5] and 
L. Grafakos and R. Torres [18] since, as we shall soon describe, we treat operators 
associated with far more singular kernels. The main new feature that distinguishes the 
operators we analyze is that they have modulation invariance in a certain direction. 
We seek a theory for them analogous to the one involved in the classical T(l) Theo- 
rem. We will have to incorporate, however, time-frequency techniques that reflect the 
modulation invariance of the bilinear operators treated. 
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supported in part by NSF under grants DMS-0556389, DMS 0503542, DMS 0400879, 
and DMS 0400423, respectively. P. Villarroya was supported in part by grant MTM2005- 
08350-C03-03 and EX2004-0510. 
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2. The main theorem, applications, and road map of the proof 

2.1. Modulation invariant bilinear T(l) Theorem. 
We start with a few definitions and examples that will lead us into formulating the clas- 
sical (linear) T(l) Theorem in its dual version and its bilinear counterpart. Throughout 
the whole paper we will restrict our attention to the one dimensional case. 

Definition 2.1. A function : M x (M \ {0}) ^ R is called a C alder on- Zygmund 
kernel if for some < 5 < 1 and some constant Ck we have 

(2.1) \K{x,t)\<CK\tr' 

(2.2) \K{x,t) - K{x',t')\ < CK\\{x,t) - {x',t')\\'\t\-'-' 
whenever \\{x,t) — (x',t')|| < \t\/2, where \\ ■ \\ denotes the euclidian norm. 
Definition 2.2. A bilinear form A, defined on the product of Schwartz spaces 

A : 5(M) X S{M) C 

is said to he associated with a standard C alder on- Zygmund kernel K if for some (3 = 
{^1,^2) and for all Schwartz functions /i,/2 G S{R) whose supports are disjoin^, we 
have 

A(/i,/2)= / fi{x + mf2{x + mK{x,t)dxdt. 

// the form is continuous on 5(]R) x iS(]R) then it will be referred to as a bilinear 
C alder on- Zygmund form. 

The above representation of A is convenient for the formulation of the trilinear forms 
that we will study. Note, however, that the simple change of variables x' := x+[32t, t' : = 
X + (3it gives the more classical representation 



(2.3) A(/i,/2)=/ f,{t')f2{x')K{x\t')dt'dx' 

where 

K( 4^ _ ^ j^( f^ix- f32t t-x ^ 
^"^' ^"1/91-/321^ /3i-/32 

satisfies 

\K{x,t)\ < CK\x-t\-^ 

and 

\kix,t)-kix',t')\ < CKi2+m)'\\{x,t) - ix',t')\\'\x-t\-'-' 

whenever c^||(x,t) - (x',t')ll < \x - t\ and Cfs = 2(1 + \\/3\\) > 2. 

We will use the notation (-I-) to denote the pairing of a distribution with a test 
function, which we take to be linear in both entries. We reserve the notation (-, •) 
to denote the usual Hilbert space inner product of L^, conjugate linear in the second 



""^The requirement that the supports are disjoint is sufRcient -due to (|2.ip - to guarantee the conver- 
gence of the integral; it is also necessary in general, as it is easily seen by working with K{x, t) — \t\^^. 
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entry. With this notation we can associate to A the hnear dual operators T and T*, 
continuous from 5(]R) to 5'(M), given by 

A(/l,/2) = (T(/i)|/2) = (T*(/2)|/i). 

We see from (12. 3p that the Schwartz kernel of T restricted away from the diagonal of 
agrees with the function K, as usually stated for linear Calderon-Zygmund operators. 
When convenient in our computations, and without loss of generality, we will often 
assume this more classical representation (12. 3p for A. 

Definition 2.3. A trilinear form A defined on S(K) x S{R) x 5(M), is said to be 
associated with a standard Calderon-Zygmund kernel K if for some (3 = (/9i, /92, /^a) 
and for all functions fi, f2, fs G iS(]R) such that the intersection of the three supports 
is empty, we have 

(2.4) A(/i,/2,/3)= / l\f,{x + (3,t)K{x,t)dxdt. 

// the form is continuous on S{M.) x S{M.) x S(M.) then it will be referred to as a trilinear 
Calderon-Zygmund form. 

Now the trilinear form A is associated to the bilinear dual operators given by 

A(/l,/2,/3) = (T3(/i,/2)|/3) = (Ti(/2,/3)|/l) = {T,{f^J,)\f,), 

but unlike the bilinear case, K is no longer the restriction of the Schwartz kernel of 
T3. In the sequel, we shall assume that ^1,^2,^3 are pairwise different, as otherwise 
the trilinear form reduces to a combination of a pointwise product and bilinear form. 
If needed, by a simple change of variables and appropriately modifying the constants 
involved in the definition of a Calderon-Zygmund kernel, we can assume (3 to be of unit 
length and perpendicular to a = (1, 1, 1). Let 7 be a unit vector perpendicular to a and 
/?, the sign of 7 being of no importance. The condition that no two components of (3 
are equal is equivalent to no component of 7 being zero. The integral representing A for 
functions with disjoint supports satisfies the modulation symmetry along the direction 
ofl- 

(2.5) A(/i, /2, /3) = A(M,, Ji, M,, ^2, J3) 

for all ^ G M. Here modulation is defined as M^/(a;) = e^'^^^^ f{x). Note, however, 
that the kernel representation does not guarantee the modulation invariance (12.51) for 
arbitrary triples of Schwartz functions /i, /2, f^. 

Definition 2.4. A trilinear Calderon-Zygmund form A associated with a standard 
kernel K for some [3 is said to have modulation symmetry in the direction 7, with 7 
of unit length and perpendicular to the plane generated by P and a, if (12.51) is satisfied 
for all functions fi, f2, fs G 5(M). 
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Let US look at typical examples of these operators given in pseudodifferential form. 
Consider again the bilinear Hilbert transform 

T(/i,/2)(x)=p.v. / f,{x-t)Mx + t)-, 

or equivalently 

T(/i,/2)(x)= / sign(e-r^)/i(0/2(^)e''^"-(«+'')rferfr/. 
More generally, one can consider operators of the form 

where m is a multiplier satisfying the classical conditions 

|m(")(M)| < Cl^r", n<N. 

Undoing the Fourier transforms of /i, /2, one arrives to the kernel representation of T, 
namely 

T(/i,/2)(x)=p.v. / f,ix-t)f2{x + t)K{t)dt, 
Jm. 

where X is a classical CZ kernel of convolution type and K(u) = m{u). These bilinear 
operators fall under the scope of the more general boundedness results stated in [H] 
and 

To introduce x-dependent kernels consider now bilinear operators of the form 

JR2 

where a{x,u) is a symbol in the Hormander class S'°q, so that 

\d^^dl^a{x,i-^)\<C^{l + \i-^\r\-\. 

Then, undoing again the Fourier transforms, we arrive to the following integral repre- 
sentation of T valid at least for functions with disjoint support: 

T(/i,/2)(x)= / K{x,x-y)5{z-2x + y)fMf2{z)dydz 

= I K{x,t)fi{x-t)f2ix + t)dt, 
Jr 

where K{x, x — y) = (JF^^a) {x, x — y) and the inverse Fourier transform is taken in the 
second variable. It is well-known that such a i^' is a Calderon-Zygmund kernel (and 
with 6=1). These bilinear operators give rise then to trilinear forms of the type (12. 4p 
with (3 = (-1,1,0). 

In the previous example, the Schwartz kernel of the bilinear operator T is given by 
k{x,y,z) = K{x,x — y)S{z — 2x + y) and hence it is too singular to fall under the 
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scope of other multilinear T(l) theorems in [5j and [18j, which essentially apply to 
pseudo differential operators of the form 



T{fi,f2){x) 



where a{x,C,,ri) satisfies the classical Coifman-Meyer estimates 

|9^f9^>(x,e,r/)|<C,(l + |e| + |r7|)-l"l 
so that the (restricted) Schwartz kernels satisfy 

< Ca,{\x-y\ + \y - z\ + |^ - 



Definition 2.5. An U'- normalized hump function 
of order N to a box I := Ii x . . . x 1^ if 



C is said to be C- adapted 



-l/p-a^, N 



m'=l 



for each < |a| < A^. 
We will use the notation 



1 + 



Xi - c(/i) 

\h\ 



-1/2 



with c(/) denoting the center of the interval /. Often times we will simply call a function 
W- adapted to / (or LP- adapted to I of some order iVo), if it is L^- normalized and 
Cat- adapted of each order N (or of order A^o); for some whose value will not be 
specified. When no normalization will be mentioned for a bump, it will be implicitly 
understood that the normalization is taken in L^. 

The implicit bounds hidden in the notation a < b that we shall use, will be allowed 
to depend on the constants of adaptation and on fixed parameters like 7, 5, a or Ck- 
The notation A ^ B will mean that A < B and B < A. 

Before we state our main result we recall the classical T{1) theorem in a form useful 
for our purposes. 

Theorem 2.6 (Linear T(l) theorem). Assume A is a C alder on- Zygmund form on 
(S(M) X iS(]R). Then A extends to a hounded bilinear form on L^(]R) x L^(]R) if and only 
if there exists an N such that 

(2.6) 



|A(0/,/)|< 



|A(/,0/)|<||J||2 

for any interval I, any L^- adapted bump function (pj of order N which is supported 
in I and any Schwartz function f. If these equivalent conditions are satisfied, then the 
bilinear form extends also to a bounded form on L^(M) x (M) for 1 < p < 00 and 
+ = 1. 
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The condition (12.61) and its symmetric form are called the restricted boundedness 
conditions. In Lemma 13.11 we will see a slightly stronger result, namely that it is 
sufficient to test the restricted boundedness condition only for those / supported in /. 

Remark 2.1. There are a few other equivalent formulations of the T(l) theorem. We 
just recall, as mentioned in the introduction, that the boundedness ofT is also equivalent 
to the weak boudedness property (that is A{ipi, ipi) < 1 for all tpi which are L^- adapted 
to I) together with the fact that some appropriately defined functions A(l, .), A(., 1) are 
in BMO. 

We now state our new result for trilinear forms. 

Theorem 2.7 (Main theorem). Assume A is a trilinear Calderdn-Zygmund form on 
5(]R) X iS(]R) X iS(]R) associated with a kernel K with parameter 5, and with modulation 
symmetry (12.51) in the direction of j (with 'ji 0). 
Then A extends to a bounded form 



|A(/i,/2,/3)| <nil/: 



for all exponents 2 < Pi,P2,P3 < oo with 

111. 
— + — + — = 1 

Pi P2 P3 

if and only if there is ar^ such that the following three estimates hold for all intervals 
I, all L^- adapted functions 0/ and ipi of order N which are also supported in I , and 
all Schwartz functions f 



(2.7) |A(</),,^,,/)|<|/|-i/2 

|A(0/,/,^/)l<|/r'/' 

iA(/,0/,^.)i<i/r^/^ 



Moreover, if these equivalent conditions hold, then for aj = 1 the following holds: 
IfO<ai< min(l/2 + S,1) fori <i < 3, then 

3 

iA(/i,/2,/3)i<nii/^iiiM • 

i=l 

//max(— 5, —1/2) < < for only one index j and < < min(l/2 + 6, 1) for the 
other two indices, then the dual operator Tj satisfies 

\\Tj{{fi)i^j)\\l/il~aj) < Y\. 



^As in the case of the classical T(l) Theorem, the value of N is not important. Once the theo- 
rem holds for some N, it also holds for any larger value. This observation will be used repeatedly 
throughout the argument. 
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To summarize, there is an a priori estimate for the form A and a tuple a of reciprocals 
of exponents provided that J2j = 1 ^^^1 max(— 5, —1/2) < aj < min(l/2+5, 1) for all 
j. Interestingly, the range of exponents a for which the theorem guarantees bounded- 
ness is the same for each 5 e 1]? while the range shrinks for 5 < |, as 5 approaches 0. 
We do not know if this range is optimal. Note also that for 5 > 1/2 in Theorem 12.71 we 
recover the same range in which the bilinear Hilbert transform is known to be bounded. 

It is worthwhile noting that the necessity of condition (12.71) and its symmetric coun- 
terparts is clear, as such conditions follow from the claimed estimates applied to special 
test functions. We call these conditions the (trilinear) restricted houndedness condi- 
tions. One can see that it is also enough to test the conditions for C°° functions 
/ supported in an interval containing / of length C|/|, where C > is a universal 
constant. See Lemma [3.31 below for more details. 

Note also that, formally, if A is a trilinear Calderon-Zygmund form satisfying the 
conditions of the main theorem, then 

A(/i,/2,l) 

is a bilinear Calderon-Zygmund form satisfying the conditions of the classical T(l) 
Theorem. We will make this reduction to bilinear forms rigorous in the next section. 
However, not every bounded bilinear Calderon-Zygmund form can be obtained this 
way, since for the trilinear form to be bounded more conditions need to be satisfied. 
We will see concrete examples in Section [71 

2.2. An application of the main theorem. 

Of course, the relevance of Theorem 12.71 is that it applies to operators with x- 
dependent kernels. We present one application to the bilinear pseudodifferential oper- 
ators mentioned before. 



Consider again the trilinear form 

A(/l,/2,/3)= / n{hj2){x)Mx)dx 



R 



a 



x,^- v)M)h{v)h{x)e'^'^^^^''^did7^dx 



with a in S^q. Note that this form has modulation symmetry in the direction 7 = 

(1,1,— 2)/v^ for all triples /i, /2, /a, not just the ones with disjoint supports. To 
check the first of the restricted bounded conditions we may assume / is supported in 
CI and compute 

iA(0/,^/,/)i < mwmwmw < \\ML4MLAin\\L^ 

<i/r^/'ii/iu- 

Here, we used that (pj and ipi are L^-normalized and adapted to intervals of length 
|/|"^ More precisely, 0/ can be written as 0/(x) = \I\~'^^'^(l)o{{x — xo)/\I\) where 0o is 
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adapted to and supported in the unit interval centered at the origin. It follows easily 
now that 

mw = \i\-'''\\UL^<c\i\-'^ 

where C depends only on finitely many derivatives of 0o- The same estimate applies 
to ifji. To obtain the other restricted boundedness conditions, write 

A(0,,/,V7)= ! T,{(Pj,ijj){x)f{x)dx 
Jm. 

and 

Hf,4>i,i^i)= / Ti{(j)i,ipi){x)f{x)dx. 
Jm. 

It was proved by A. Benyi, A. Nahmod and R. Torres [1], that Ti and T2 can be 
computed from a and they admit pseudodifferential representations of the form|^ 

T2{f,g){x)= [ a,{x,i,n)mg{r^)e'-^<^+^^didr^ 

and 

T,U,g){x)= [ a^{x,^,r^)mme'''''^^'"'^d^dri 
where a2{x,$,,ri) and cri(x, ^,77) satisfy 

\d'.dl^a2{x,^,v)\<il + m + v\r^^^ 

and 

|5^;5?>i(^,e,r])i <(i + ie + 2r7i)-'"|. 

The computations done with T3 can now be repeated with T2 and Ti. It follows that 
A and the Tj have the boundedness properties of Theorem 12.71 with 5 = 1. 

Similar examples of forms can be obtained by starting with a bilinear operator 
T3 given by a symbol of the form ag{x,^,r]) = a{x,rj — ^tan6') for a in S^q and 
6 7^ — 7r/4, , 0, 7r/2 (we make the convention cr7r/2(a;, ■C? "'?) = ^{^lO)- the three 
forbidden cases when 9 = —tc/A, , 0, 7r/2 the trilinear forms correspond again to a com- 
bination of a pointwise product and a bilinear form. See |lj for more details. 

2.3. Plan of the proof. 

The rest of this article is structured as follows. In Section [31 after some basic reduc- 
tions, we present some equivalent formulations of the main theorem. In particular, in 
analogy to the classical T(l) theorem, we give a meaning to the functions Tj(l, 1) for 
j = 1, 2, 3 (see Lemma [3171 below) and show that the restricted boundedness conditions 
imply that these functions are in BMO. We also observe that the restricted bounded- 
ness conditions imply a certain weaker one (13.81) . which together with the conditions 
Tj(l, 1) G BMO is all what will be used to prove the main theorem. Hence this set of 
conditions is also necessary and sufficient to obtain the bounds on the trilinear form. 



3 Ti(/,3) = T*\gJ) and T2{f,g) = T*\f,g) in the notation of [IJ. 
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In Section H] we establish some bounds on the action of trihnear Caldeon-Zygmund 
forms satisfying the alluded weak continuity fl3.8p and some special cancellation condi- 
tions on bumps functions. These are almost orthogonality type conditions. The proof 
of the Theorem 12 . 71 then splits into two steps. First, one proves the theorem under the 
special cancellation condition that Tj[l,l) = for all j. This step is done in Sections 
[5] and El In the former the problem is reduced to a time-frequency model form, which 
is then estimated in the latter. 

The second step in the proof of the theorem is to construct for each given BMO 
function b, forms Aj, j = 1,2,3, which are associated to Calderon-Zygmund kernels, 
have a given modulation symmetry, satisfy the bounds of the theorem, and are such 
that the corresponding dual operators satisfy Tj(l, 1) = 6 and Tj(l, 1) = for i 7^ j By 
analogy again with the classical T(l) Theorem, we call these special forms modulation 
invariant paraproducts. Then, Theorem 12.71 can be always reduced to the case with 
special cancellation by subtracting from the original form A three paraproducts with 
the same modulation symmetry. The paraproducts are discussed in Section [71 

3. Alternative formulations and the role of BMO 

We begin this discussion by a lemma that implies the strengthening of the classical 
Theorem 12.61 that was mentioned after the statement of the theorem. This will be used 
in the proof of Lemma 13. 3[ 

From now on, for each box / in M" and each R> 0, RI will denote the box with the 
same center as / and sidelengths R times larger than those of /. 

Lemma 3.1. Assume A is a continuous bilinear form onS{M.) xS(R) that is associated 
with a standard Calderon-Zygmund kernel K. Assume there is an N such that the 
restricted boundedness condition 

mij)\<co\\fh 

holds for all bump functions 0/ of order N adapted to and supported in I and all 
Schwartz functions f supported in I. Then, 

\A{<Pi,f)\<C\\fh 

(with a possibly larger constant C) also holds for all bump functions (pj of order N 
adapted to and supported in I and all Schwartz functions f (not necessarily supported 
in I). 

Proof. Without loss of generality, we may assume that / is centered at the origin. 
Also, by the continuity of A on iS(]R) x 5(]R), it is enough to prove the conclusion 
for all / in iS with compact support. Consider first the case when 0/ has mean zero. 
Let be a large constant chosen later depending on j3. Decompose now in a smooth 
way f = fi + /2, with /i supported in 2RI and /2 supported outside RI. Since 
(2i?)~(^/^+^^0/ is adapted to and supported in 2RI, the hypotheses of the Lemma give 
then 

|A(0/,/i)|<Co(2i?)(V2+A^)||/|h. 
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To estimate A(0/, f2) we use the kernel representation with the change of variables 
explained in the introduction to get 



A(0/,/2) = J (t)i{u)f2{v)K{u,v)dudv, 

where K satisfies 

\k{u, v) - k{u', v') I < Cp,,p, \u - v\-^-'\\ (w, v) - v')\\' 

for c^IKm,^) — (m',^')!! < |m — v| and cp > 2. The domain of integration is given by 
\u\ < \I\/2 and |^;| > -R|i"|/2, which imply that \u - v\ > {R - l)|/|/2 > Cfi\u\ if R is 
large enough. Thus, using the mean zero of 0/ we can write 

A(0/,/2) = j Mu)f2{v){k{u,v)-K{0,v))dudv 

and obtain 

\Hh,f2)\<Cp,,p,J \Mu)mv)\\uf\u-v\-^'+'Uudv 

<C^,a\I\' [ \Mu)\\f2{v)\\u-v\-('+'Uudv 

J \u—v\>c\I\ 

< Cp„pM\M2\\f2\\2 [ dx < Cp„M\2- 

J\x\>c\I\ 

This proves 

mI,f)\<C^\\fh 

for an appropriate constant Ci under the additional assumption that 0/ has mean zero. 

To treat the general case, define (p2ji{x) = 2~-'/^0/(2~%) for j > and observe that 
2-{i/2+N)(^^^.^ — 2~^/^02J+i7) is adapted to and supported in 2-'+^7 and has mean zero. 
Let now k >Ohe the smallest integer such that the support of / is contained in 
and write 

k 

H<Pi, /) = E 2"'/'A(02.7 - 2-^/V2.+i7, /) + 2-('=+i)/2A(02^+i„ /). 

j=0 

In the first k + 1 terms the bumps have mean zero, while the last term can be controlled 
by the hypothesis. Thus, 

k 

|A(0,,/)| <(Ci2WE2-^/2 + Co2-('=+i)/2)||/||2<C||/||2, 

j=0 

where C is independent of fc. □ 

We continue by studying the relationship between bilinear and trilinear forms. By 
the Schwartz kernel theorem, the trilinear form A can be represented by a tempered 
distribution in R^, which we shall also denote by A, so that 

A((/)i, 02, 03) = A(0i (8) 02 03)- 
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In this way, A(0) has a meaning for any cj) G 5(]R^) not necessarily a tensor product. 

Moreover, the modulation invariance of A implies that the distribution is supported 
on the orthogonal complement of 7. Even stronger, it is given by a two dimensional 
distribution A^. applied to the restriction : 7"*" — ^ C of the test function to 7-*". 
This can be seen as follows. Consider first a function / in C^(]R^) and pick another 
function Lp in C^(M^) with = 1 on a neighborhood of the support of /. By the 
linearity, continuity, and modulation invariance of A, and writing ^ = ^7 + ^* with ^* 
in 7-*-, we get 



A(/) = A(^/) = A( / ^{x)e^^^^^f{m) = A( / / ^{x)e'-'^^' e'^'^^^ f {t^ + C)dtdC) 

J JR J-y^ 

A((^(x)e2'^"^*e'™«*)/(t7 + r)dtrfr = / / A(v9(x)e2"*"«*)/(t7 + Ddtrff 




JR Jy^ JR3 

where = f{x*). (Note that (pf^, is still in C^(]R'^)). In particular, if two functions 
/i and /2 in C^(]R^) agree on 7-^, then A(/i — = 0. Also, since C^(]R^) is dense 
in iS(]R^), a simple limiting argument shows that A(/) = for all / G iS(]R^) with 
supp / n 7"*" = and so supp A is contained in 7-*-. 

Now, let ip be in C^(]R), ip = 1 in a neighborhood of zero, and define ipi^) = 
ijj{t'j + X*) = ip{t). We have already seen that for / G C^(]R^), A(/) depends only on 
/l^i so 

A(/) = A(V^/) = A(^/.) 

and a limiting argument gives the same result for / G i5(]R'^). This also allows to define 
a distribution in 5' (7-*-) by 

for all g G 5(7-'"), where ^ : — > C satisfies g{t'~f + x*) = g{x*). The definition is 
clearly independent of the choice of ip and we have 

A(/)=A,(/|^,). 

Clearly, by a density argument, the kernel representation of A continues to hold 
when the test function (p is no longer a tensor product but still has support disjoint 
from the span of (1, 1, 1). 

Moreover, 

and the integral is absolutely convergent as long as function cj) just vanishes on the span 
of (1,1,1), but provided that A satisfies a weak boundedness property (13. 8p below, 
which is implied by the restricted boundedness conditions. To verify this, the reader 
may adapt to the case of trilinear forms the arguments used in, for example, [SU] for 
bilinear ones. 



MODULATION INVARIANT BILINEAR T(l) THEOREM 13 

There are three distinct bihnear forms that we can consider now 

Al(02,03) = A*((l®02®03)|-,i) 

A2(01,03) = A,((0i®l®03)|^x) 
A3(01,02) = A,((0i®02®l)|^x) 

Observe that the functions on the right hand side are in iS(7"'"), since none of the 
components of 7 is zero. Moreover in case of disjointly supported functions 02, 03 we 
obtain a kernel representation for Ai with the same kernel K. In fact, since ■0(1 ® 02 ® 
03) e <S(M^) and obviously has the same values as -0(1 ® 02 ® 03)* on 7^, 

Al(02,03) = A,((l®02®03)|^i) = A(^(l®02®03)*) = A(^( 1 ® 02 ® 03) ) 

ip{xa+t[3)(t)2{x+l32t)(t)z{.x+[3'j,t)K{x, i) dxdt = j 02(a;+/?2^)03(a^+/?3^)-f^(a;, i) dxdt. 
Similarly for A2 and A3. 

In the sequel we will use the following notation. 

Definition 3.2. For a > 0, f G M'^ the -normalized dilation operator and the 
translation operator are defined via 

Dlfix) = a-'^/^fia-'x), rJix) = fix - v), 

for all functions f defined on M'^. We will also sometimes write 

DJix) = D^fix) = f{a~'x). 

Lemma 3.3. The restricted houndedness conditions for A are equivalent to the re- 
stricted boundedness conditions for the Aj 's, if one is willing to have a loss in the order 
of the bump functions and the constants involved in defining restricted boundedness. 

Proof. We assume first that A satisfies the restricted boundedness conditions, and prove 
restricted boundedness of Ai (A2 and A3 can be done analogously). Let $ : M ^ [0, 1] 
be smooth with 

. . $(x) = l |a;| < 1 

^ ' $(a;) = |a;| > 2. 

It is easy to check that if / is supported in the interval / (Lemma 13.11 allows us to 
restrict attention to this case) then 

(1 ® 0/ ® /) l^x = (r,(,)Dc;|/|$ ® 0/ ® /) l^x , 

where C is a sufficiently large constant depending only on 7. To obtain restricted 
boundedness for Ai, we simply apply the restricted boundedness of A for the enlarged 
interval CI. 

For the converse, we first show, as claimed in the introduction, that it is enough to 
show the restricted boundedness estimates for A(0/ ® V'/ ® /) fo^' every C°° function 
/ supported in CI, where C is a universal constant depending on 7. To see this, 



14 A. BENYI, C. DEMETER, A.R. NAHMOD, CM. THIELE, R.H. TORRES, AND R VILLARROYA 



and without loss of generality, we may assume / centered at the origin. The region 
\x + Pitl < \I\/2 and \x + ^2^1 < 1-^1/2 is a parallelogram centered at the origin in the 
{x,t) while \x + jS^tl > C\I\/2 is the complement of a strip along the line x + jS^t = 0. 
Recall that the components of (3 are pairwise distinct, so if C is large enough depending 
only on j3, then the two regions do not intersect. Decompose now in a smooth way 
/ = fci + if — fci), where fci coincides with / on CI, is zero outside 2CI and 
satisfies H/c/lh < 2||//||2 Note that 4>i ^ ipi ^ {f — fci){xa + = 0, so we must have 
A(0/ ® tpi ® {f — fci)) = 0, and the claim follows. 

Assume now that all the Aj satisfy restricted boundedness conditions of order N . For 
simplicity of notation also assume that I is of length 1/C and centered at the origin and 
that 0/, ijji are bump functions adapted of order N' ^ N and supported in /. Let / be 
a smooth function supported in CI. Let g be adapted of order N' and supported on 
< 1/(2C) < 1/2, |t>| < 1/2} and so that g{u, v)ipi{u)f{v) = ipi{u)f{y) on the same 
region. Write v) = 7^^(72^* + 73^)) and note that (pg is adapted of order A^' to 
[-1/2, 1/2] X [-1/2, 1/2]. Expanding (pg in Fourier series on [-1/2, 1/2] x [-1/2, 1/2] 
we get 

+ Pit)Mx + p2t)f{x + Pst) 

= 0(x + /32t, X + /33t)g{x + /32t, x + I33t)^i{x + P2t)f{x + P^t) 



m2,m3 



with 

Now, the function 



< (1 + max(|m2|, ImgD) 
;i + \m2\)-^^i{x)e^''''^^'' 



is a bump function of order A^ adapted to the interval /. The factor (1 + \m2\)~^ is 
needed to offset the loss of powers of m2 when taking derivatives of the exponential 
factor. Applying the restricted boundedness of Ai proves that 

K{(t)i®^i® f)= 5^ c„2,m3(l + l"^2|)'^Ai((l + |m2|)-^M„,V',,M^3/) < 



"12, ma 

□ 

Assuming Theorem 12.71 is true, we have thereby seen the following corollary of The- 
orems 12.61 and 12.71 

Theorem 3.4. Let A be a trilinear C alder on- Zygmund form with modulation symmetry 
in direction 7. Then A extends to hounded trilinear form with exponents as in Theorem 
2.7\ if and only if Aj for i = 1,2,3 are bounded in L"^ x L'^. 



The lemma below will allow us to state the restricted boundedness property in a 
slightly more general way. We need to consider bumps which may no longer be com- 
pactly supported but are still concentrated around appropriate intervals. 
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Lemma 3.5. Assume A is a Calderon-Zygmund trilin ear form with modulation sym- 
metry in the direction 7 and that satisfies the restricted boundedness conditions of 
Theorem 2.1 for some N. Then for all intervals I, all L^- normalized bump functions 
4>ixi L"^- adapted to I x I of order N' » N and all functions f G 5(]R) we have the 
estimate 

iA(0/x/®/)i<i/r^/'i 



and the symmetric inequalities. 

Proof. For simplicity of notation we shall assume / is centered at the origin and of 
length 1. We take a lacunary decomposition related to /: let $ be as in fl3.1l) and 
define 

(j)k = ($fe ® - ® $fe_l)0/x/ 

for k > 0, where = Note that the functions (pk add up to so by using 

the continuity of A in 5(]R^) we obtain 

(3.2) A(0/x/®/) = 5^A(0fc®/). 

A;>0 

Note also that 2^^^^^ is L^-adapted to and supported in 2'^+^(/ x /). We write 

where (pk means the periodization of (pk from the square 2^+^(1 x /). By performing a 
windowed Fourier series, we obtain 

where the coefficients Ck^mi,m2 are rapidly decaying in the sense that 

(3.3) |cfc,^,,„J < 2-5^(1 + max(|mi|,|m2|))-2^-^ 

Denoting by ^k,mix) = $fc(x)e^'^*^~^*^^''^'^ we have that the functions \m\^'^2~''/'^^k,m 
are L^- normalized, adapted of order and supported in 2^~^'^I, uniformly in m and 
k. 

By (13.21) we have 
|A((/)®/)| < ^ ^ |cfc 

,mi,m2 

||A(<l>fc,^,,<l>fc,^,,/)| 

fc>0 mi,m2GZ 
fc>0 mi,m2£Z 

Now the estimate of the lemma follows by applying fl3.3p and the restricted boundedness 
condition to each summand on the right hand side with interval 2^"*"^/. □ 
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We will now give a rigorous definition of Tj(l, 1) as a distribution modulo constants 
and prove that the restricted boundedness property implies that they are elements 
of BMO. The approach is similar to the linear case and we will follow some of the 
arguments in [27]. See also [30] and [18] for similar linear and multilinear definitions. 
We start with the following lemma. 

Lemma 3.6. Assume that A is a trilinear C alder on- Zygmund form with modulation 
symmetry in the direction 7. Let $ he as in (13. ip . For every function f with mean 
zero the limit 

(3.4) L{f) = lim AiD^.i^ ® $) ® /) = Km {^{0^,^, D^,^)\f) 

exists. Moreover, z/supp/ C (—2*^", 2*^"), then for sufficiently large k (depending on ko 
and j) we have the error bound 

(3.5) \L{f) - A{D^,{^ ® $) ® /)| < 2-^(^-'=°)||/|Ui 

where 6 is the parameter in the C alder on- Zygmund property of the kernel K and the 
implicit constant is independent of ko, k and f . 

Proof Assume supp/ C (—2'^", 2^°). For all j > k^ ko write 

j-k 

D^,{^ ® $) - Z}^(<l> ® $) = ^D2°f+.($ ® $) - L'^+.-i($ ® $) 

1=1 

and let = D^+^i^ ® $) - D^+^-ii^ ® $)• We will estimate \A{ipi ® f)\ to prove 
that the sequence A(D^($ ® $) (S> /) is Cauchy, as well as to estimate the error bound. 
Since the support oi ipi® f is disjoint from the span of (1, 1, 1), we can use the kernel 
representation of A 

A(V'« ®f) = j + Pli, X + p2t)f{x + p3t)K{x, t) dxdt. 

Similarly to what we did in Section 12.11 for bilinear forms, a simple change of variables 
allows us to write the above integral in the form 



ipii^u, v)f{w)K{u, w) dudw 

where v = — 72'^(7i'U + 73W) and K satisfies the classical Calderon-Zygmund estimates 
(3.6) \K{u,w)\<\u-wr\ 



(3.7) \K{u,w) - K{u',w')\ < \u~w\-^^+^'>\\{u,w) - {u',w 

for C/3II {u, w) — {u', w') II < \u — w\. 
On the support of ipi, 

2^+^'^ < \\{u,v)\\ < 2'+^'+^ 
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and for w G supp/, \w\ < 2'"'. Note that if \u\ < (1 + |72"Si|)""^2''+'"^ and k > fco, 
then 

\\iu,v)\\ < \u\il + |72'Si|) + |72"S|2'« < 2^=+'-^ 
Thus, in the integral representation of A{ipi ® /) we may assume 

2''+' < Inl < 2^=+'+^ 

I I — 

Using the mean zero property of / we obtain 

Mi^i ® f) = j H'liu, v)k{u, w) - ipi{u, -'j2^'jiu)k{u, 0)]/(w) dwdu. 

We now write the term in square brackets as 

iipiiu,v) - ipi{u,Vo))k{u,w) + i)i{u,Vo){k{u,w) - k{u,0)) 

and estimate each term by its supremum norm on the domain of integration. Clearly, 
we have {ipil < 2. As the derivative of ipi is 0(2~'^~'), we have 

\M^^,v) - Mu,vo)\ <2-'-'\w\ <2-''-'2''^\ 

For K we have from the Calderon-Zygmund estimates 

\k{u,w)\ <\u- w\-' < {2'+' - 2^«)-i < 2-'-^ 

and 

\k{u,w) - k{u,o)\ < \u\-^^-^^^\w\^ <2^-^-^^^^+^h''"\ 

Hence we can estimate 

IHi^i ®f)\< (2-2(^+02^0 + 2-('=+0(i+5)2'=o^) [ [ \f{w)\ dudw 

Summing in / finishes the proof of the lemma. □ 

Lemma 3.7. Assume that A satisfies the restricted houndedness conditions. Then the 
linear functional given by Iji3.4\ ) can be extended to all the Hardy space and defines 
an element of BMO that we will denote by T^{1, 1). 

Proof. We will show first that the functions T^{D^^, -D^$) are uniformly bounded in 
BMO. To simplify the notation, let 0^ = ® and let large constant 

(depending only on 7) whose value will become clear later. We will show that the mean 
oscillation of T^{(t)k) on some arbitrary interval / is 0(1). 
If I is an interval with c^|/| > 2^^, then by Lemma [3.51 

||T3(0fc)|U^(/)< ||T3(0,)|U.<2'=/2<|/|i^ 

which implies that the mean oscillation of T^{(j)k) on I is 0(1). 
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Assume next that / is an interval centered at the point c(/) and such that c^|/| < 2^. 
Let M be the smallest integer so that [— 2^^, 2^] C c^2^ I and write 

M-1 
1=0 

where 
and 

By Lemma 13.51 , for any function in / in we can write (with a small abuse of 
notation) 

M-1 M-1 

(T3(0fc)|/) = A(0fc ®f) = Y^ h{i^k,i,i ®f) = J2{n{^k,i,i)\f) 

1=0 1=0 

Since c^\I\ < 2^^, |/|~^'?/'/c,/,o is L^- adapted to / x / and we can estimate again using 
Lemma 13.51 

iit3(^m,o)iil^(/)< i/r^', 

which implies that the mean oscillation of T^{'ipk,ifl) on / is 0(1). In the other terms 
T3{ipk,i,i) can be represented on I by absolutely convergent integrals which, after the 
usual change of coordinates, take the form 

T3{tpk,i,i){w) = j tpk,i,i{u,v)k{u,w) du, 

with t> = — 72^"'^(7iM+73ti'). Observing that t> — c(/) = — 7^"*^ (71 (u — c(/)) +73(^ — 0(1))), 
we may proceed as in the previous lemma to verify that the above integral may be 
restricted to the region where 

c'^c^2^\I\ < \u~c{I)\ < c^2'+Vl 

for some small constant depending only on 7. We take now large enough so that 

\u - c(/)| > max{10 x 2',c^}|J|, 

where C/3 is associated with K as in Lemma [3l6l 
Define Vq = —7^7"^ (71M + 730(1)) and the constant 



Ck,i,i = J ipk,i,i{'^^'^o)K{u,c{I))du 

and write 

\T3{i'k,i,i){w) - Ckj,i\ < 
\ipk,i,i{'^^'^) - V^fc,/,/(M,t^o)| \K{u,w)\du+ / lipkjA'^^'^o)] \K{u,w) - K{u,c{I)\du. 
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Take now some w E I. We can apply (13.71) to obtain that the second integral above 
is bounded (up to a multiplicative constant) by 

|/|V-c(/)r^'+^^ du<2-'^. 

'|«-c{/)|>10x2'|/| 

To estimate the first integral note that 

ll^lloo< ■ , 9^V l^-^o|<|/|, \k{u,w)\<{2^\I\r\ 

ov min(c^2'|i I, z'') 

and the region of integration has length 0(min(cy2'|/|, 2^^)). By putting these things 
together, the first integral above is easily seen to be 0(2~'). This proves that the mean 
oscillation of T^{^k,i,i) oygi the interval / is 0{2~^^). Finally, by the triangle inequality 
we conclude that \\Tz{(t)k)\\BMO < 1- 

To conclude the proof we can now use the - BMO duality. In fact, since the 
unit ball of the dual of a Banach space is weak*-compact, we can extract a subsequence 
of r3(0fcj so that lim,-^oo(T3(0fc,), /) = (T3(l, 1)|/) for some r3(l, 1) G BMO and all 
/ e H^. However, the previous lemma shows that the sequence on the left converges 
to L{f) for C°° functions / with compact support and mean zero. Since such functions 
are dense in H^, it follows that the continuous functional induced by T3(l, 1) on i7^(M) 
extends L to all and that T3(l, 1) is the unique limit (in BMO) of the sequence 

T3(0fc). □ 

The following result shows that the definition of T^{1, 1) in Lemma [3^ is independent 
of the choice of function $ in a very general sense. 

Lemma 3.8. Let f he a Schwartz function supported on I = [—1/2,1/2] and with 
mean zero, and k > 0. Then, for every L°° -normalized bump function adapted to the 
square [— 2'^,2'^] x [—2^,2^] with (p = 1 in a neighborhood o/(0,0) we have the estimate 



\{n{i,i)\f)-A{^,f)\<a,s'2-''' 



LI, 



where we can take S' = 6 if 6 < 1 and 6' = 1^ if 6 = 1. 

Moreover, if A satisfies the restricted boundedness conditions then the estimate above 
still holds with \\f\\i replaced by \\f\\2 (note = 1), if the L°° -normalized bump 

function (p adapted to the square [— 2'^,2*'] x [— 2^,2^^] satisfies only 0(0,0) = 1. 

Proof. Let $ be as before. Considering ipk = -D^($ ® $) — 0, we need to prove 

\A{^k®f)\<2-''. 

Let 

For a large ko and k ^ ko, write 

oo k—ko 
1=0 1=1 
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where 
and 

We can apply the reasoning of Lemma [3l6] to '^k±i to write K{^k±i ® f) as the sum of 
two integrals, one involving {'V'^k±i)K and the other involving j^j-iCV K) , and where 
the integration in u takes place for \u\ ~ 2^^^'. We then need to estimate |V\l/fc±/| and 
and combine them with the Calderon-Zygmund estimates on K (which can be 
applied if ko is chosen large enough depending only on 7). 

For we can use in the first integral that ipk is adapted to a square of side length 
2*^ to get 

< (2-('=+0||^,|U^ + ||$,+,|Uo.2-'=(l + ^)-^ < 2-('=+') 

and in the second integral we simply use that that "^k+i is bounded. Together with the 
Calderon-Zygmund estimates this leads to estimates on the integrals of order 2"'^'^"'"'^ 
and 2"^*^^^-*'', respectively, which sum in / to the correct bound. On the other hand, 
since 0(0, 0) = 1, we can use for "^k-i in the first integral 

|V*fc_z| < |V$fc_/||V^fc| + \<^k-i\Wk\ 

< 2-('=-02-*^2(^'"') + ||$fc_;||Loo2"'' < 2"^ 

and 

l^fc i\ < 2-^2^^-^^ < 

I /t — t I 

in the second integral, which leads to estimates of the order of 2"^^ and 2^'^^2^''^^~^\ 
respectively. Noting that there are less than k terms in this case, we get to an estimate 
of the form 2"'^'^ with 6' as in the statement. 

Finally, note that the last term, A{'9ko ® /) is zero if = 1 in a neighborhood of 
(0, 0) or otherwise it can be controlled by the restricted boundedness property since 
^fcg is adapted to a cube of side length 2''° with a constant of the order of 2"*"'. □ 

Remark 3.1. For any fixed y and $ as in (13.11) . the function T(^y^y)D'^{^ ® $) is 
an L°° -normalized bump function adapted to [—2^,2^] x [— 2'^,2^] and equal to 1 in a 
neighborhood of the origin, for sufficiently large k. Hence we may also write 

A(l, 1, /) = (T3(l, 1)1/) = hm A(r(,,,)D2=^($ »$),/). 

fe— >oo 

Moreover, an examination of the proof of Lemma \3.b\ shows that if the test function f 
with mean zero satisfies supp f C I , then 

|A(1, 1, /) - A(r(,(,),,(,))D2^„|($ $), /)| < 2'''\\fU^, 

for large k depending only on 7. 
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We now introduce a weaker continuity property on A alluded to before. 

Definition 3.9. We say that a trilinear form A satisfies the weak houndedness property 
if for any interval I and any (p that is L'^ -normalized and adapted to I x I x I of order 

(3.8) |A(0)| <C|/|-^/2_ 

The restricted boundedness conditions with order N' ^ N imply (13.81) of order A^. 
This is immediate for given by a tensor product, while arguments similar to the ones 
used in Lemma 13.51 give the general case. We will prove Theorem 12.71 assuming only 
the weak boundedness property and that the distributions Tj{l, 1) given by (13. 4p (and 
its symmetric versions) are in BMO for j = 1, 2, 3. Since, as we already saw, all these 
conditions are implied by the restricted boundedness, it will follow then yet another 
formulation of the main theorem. 

Theorem 3.10. Let A be a C alder on- Zygmund trilinear form with modulation symme- 



try in the direction of •y. Then A is bounded in the range of exponents of Theorem\2. 
if and only ifTj{l, 1) G BMO for each j = 1, 2, 3, and A satisfies the weak boundedness 
property (13.81) . 

While the Calderon- Zygmund condition does not distinguish between Calderon- 
Zygmund kernels associated with bilinear or trilinear forms, the BMO conditions con- 
stitute a real difference between distributions which are bounded bilinear forms and 
those which are bounded trilinear forms. For trilinear forms, there are three BMO 
conditions, while for bilinear forms there are only two. In the section on paraproducts 
we will see that the three BMO conditions are indeed independent, and we can adjust 
the three functions Tj{l, 1) independently for each j. In particular, we can take two of 
these functions in BMO and a third one not in BMO and construct a distribution that 
provides a bounded bilinear form but not a bounded trilinear one. 

We also observe the following. 

Remark 3.2. Let A be a trilinear Calderon- Zygmund form with modulation symmetry 
in the direction 7 and satisfying (13.81) . Then, the forms Aj, j = 1,2,3, satisfy the 
bilinear weak boundedness property of Remark \2.1\ and the identities 

Ai(l,-) = A(1,1,-) = A2(1,-), 

A2(-,1) = A(-,1,1) = A3(-,1), 

A3(l,-) = A(l,-,l) = Ai(-,l), 
hold when the terms are interpreted as tempered distributions modulo constants. 

It is trivial to check the bilinear weak boundedness conditions for the Aj given the 
one for A. The above identities are also clear at a formal level. To verify them in a 
rigorous way, let / be a test function with mean zero supported in some interval / 
centered at the origin. For C large enough depending just on 7 and for k large enough 
compared to log |/| we have 

Ai(D2.$, /) = A{D^,Dc^, ^2^$, /), 
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because 1 D2k^ ® / = D2kDc^ ® D2k^ / on 7-^. As A; ^ 00 the left hand side 
tends to Ai(l, /) while, by Lemma [3. 8[ the right hand side approaches A(l, 1, /) since 
(g) $ = 1 in a neighborhood of (0,0). A symmetric reasoning gives the other 
identities. 

4. Bump functions estimates 

In this section we prove some estimates on bump functions under the additional 
assumption of Tj{l, 1) = (in the BMO sense) for j = 1, 2, 3. Recall that Tj(l, 1) has 
been defined in Lemma [3171 We will study the action of A on triples of bump functions 
and obtain good estimates in terms of the localization in space and frequency of the 
bump functions. The estimates will be obtained from a reduction to the bilinear form 
case, which we will discuss first. Lemma 14.31 below is a slight generalization of some 
almost orthogonality estimates found in the literature. 

In what follows, we will write diam(Ji, I2) for the diameter diam(Ji U/2) of the union 
of the two intervals. We will also use the fact that if > I/2I then 

^ ^ diam(/i,/2) ^ ^ ^ |c(/i)-c(/2)| _ 

1-^1 1 1-^1 1 

Lemma 4.1. Let (pj and cj be bump functions L^- adapted of order N to the intervals 
J and I. We have 



(4.1) J |0j(x)| \cjix)\dx < (min(-|^, ^ (1 + (max(|J|, | J|)-Miam(/, J)) 



-N 



3/2 

(f>j{x)ci{x) dx 



\I\ 



< ' f^) (l + |JrMiam(/, J))-^ 



while 
(4.2) 

as long as \J\ > \I\ and J c/(x) dx = 0. 

Note that the estimate in (14. ip only needs the size estimates on 0j and c/ but 
not the ones on their derivatives. Similar estimates are obtained when the bumps (pj 
are replaced by the ones obtained by the action of Calderon-Zygmund operators with 
certain cancellation. More precisely one has. 

Lemma 4.2. Let 0j and cj be as in the previous lemma with N > 2. Assume that 
A is a bilinear Calderon-Zygmund form associated with a kernel K with regularity 
parameter 5. Suppose that A satisfies the restricted boundedness conditions (12.61) (or 
equivalently that is bounded on L^ x L"^) and that the linear dual operato^ T has the 
special cancellation condition T(l) = 0. Then for each < 5' < 5, 
(4.3) 

j \n<Pj){x)\ \ci{x)\ dx < (min(^, H)^ (1 + (max(|/|, | J|)-Miam(/, J))-^'+''\ 



Recall that this is A(/i,/2) = / r(/i)/2 
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provided that J (t>j{x) dx = 0, while 

(4.4) I Ti<Pj){x)cj{x) dx < (^^^ ^ ^ (1 + I JrMiam(/, J))-^'+''^ 

if \J\ > l-^l; / 4>j{^) dx = 0, and f C/(x) dx = 0. 



For a proof of Lemma [4.11 and Lemma [4.21 see, e.g., [22] and [13]. We have now the 
following generalization. 

Lemma 4.3. Assume again that A is a bilinear form associated with a Calderon- 
Zygmund kernel K with regularity parameter 5. Suppose that A is hounded and that 
the dual linear operator T has the special cancellation condition T(l) = 0. 

Let \Ii\ > I/2I be intervals and let ip G iS(]R^) be a bump adapted to Ii x I2 of order 
at least 4. Assume ip has mean zero in the second variable, meaning 

1p{Xi,X2) dx2 = 



for all Xi . Then for each < 6' < 6, 

1/2+5' 

(4.5) \Am< 



[jhlJ (l + |/ir'diam(/i,/2)) 



Proof. Let ip{xi,X2) be adapted to Ji x I2 with > I/2I and having mean zero in 
the second variable. Using wavelets, we can expand it in the first variable into a a 
family of bumps (wavelets) {4>j} which are L^- adapted to the dyadic intervals J and 
have mean zero. Furthermore, using the linearity of A and the fact that A extends to 
a continuous functional on L^(]R^) (this being a consequence of the hypothesis and of 
Theorem 12.61) . we obtain 

|AWI<El^(W-'^2),0j)0j(xi))| = J]|A(0X^i)4(x2))|= J]|(T0,|4)|, 
J J J 

where the c/^ has mean zero and is adapted to the interval I2 but with implicit bounds 
depending on the relative sizes of I2 and J that can be estimated using (14. ip and (14.21) . 
We will use (14. 3 p and (14. 4p to estimate {{Tcpjlcf^)]. We proceed as follows. 
We split the sum as 

E= E H- E H- E =Eh-E+E- 

J J:|J|<|/2|<|/l| J:|-f2|<lJl<l/ll J:|/2|<|/l|<|J| ABC 

Using (14.21) and (14. 3 p we estimate 

|J|y^V|J|y^^ diam( J, /a) diam( J, Ji) 



Ejs E [W + + 

A J:|J|<|/2|<|/l' 
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where we have used a discrete version of estimate (14. II) to sum on all intervals J of a 
fixed scale. That creates the normalization factors in the last line. Simplifying, we get 



^ ^ 1^21 - diam(Ji,/2) >^ ^ Ihr^ diam(/i,/2) 





|l/2 




3/2 



A ' ' ' ' 2-'^<|/2| 

which gives a better estimate than the desired one. 

Similarly for J2b^ ^^^^ ( 14.11) and (14.31) and then sum at each scale to get the 
estimate in (14.51) . 

To estimate we use (14.11) and (14. 4p and 6' < 6" < 6 to compute 



C J:|/2|<l/l|<|J| 



f ( \h{\ ^'"^ M, diam( J, /a) diam( J, /i) 



By further splitting the above summation according to the relative size of with 
respect to diam(/i,/2), we get again the desired bound. □ 

Next, we formulate the corresponding estimates for trilinear forms. We will use the 
following notation. 

Definition 4.4. For an interval uj = (a, b) and a constant c > 0, letaj = (ca, cb) (that 
is the dilation of u from the origin, not from its center which we denote by cu). 

Lemma 4.5. Let A be a trilinear C alder on- Zygmund form associated with a kernel K 
with parameter 5, with modulation symmetry in the direction of 7 with ji 7^ 0, and 
which satisfies the weak boundedness condition (13.81) . Assume also that A(l, 1, ■) = 
A(l, -, 1) = A(l, 1, ■) = 0. Let ki = k2 > k^ be three integers. For each 1 < z < 3, let 
uj[ be an interval of length 2~^' , and assume that tUg fl ^cu^ = and cu'^ fl ^0^2 = 0. 

Let li be an interval of length and assume G 5(]R^) is a bump function of 
order 2N adapted to Ii x I2 x I3 and with Fourier transform supported in the box 
cu' := uj[ X X u'^. Then, for each < 6' < 6 we have 
(4.6) 

|A(0)i < (w)'"''' + ( 1 + ' ^-^^^'^'^ 



-N 



where we have written diam(/i, /2, /a) for diam(|J^^j^ Jj) and \I\ = \Ii\ = \I2\ 

Similar statements and estimates holds by symmetry for any permutation of the 
indices 1, 2, 3. 

Proof. We assume without loss of generality that diam(Ji, /2, /a) is comparable to 
diam(J2,/3). This can be achieved by switching the otherwise symmetric roles of Ji 
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and I2 if necessary. We write 

ipix + (32t, X + 13-it) = (f){x + (3it, X + I32t, X + /Sst), 

and note that Ai (■?/') = A(</)). We plan to apply Lemma [4.31 to Ai and if). 

First observe that by hypothesis and Remark 13. 2[ Ai is bounded and satisfies the 
requirements of Lemma H73l Second, we claim the function %Ij is adapted to I2 x J3 with 
constant |/|^^/^M~^, where 

5M= 1 + |5^7ic(/i 
Clearly, it is enough to see this when E7«'^(-^i)l ^ 1- We truncate 

where 

^l{Xi,X2) = tp{Xi, X2)^Ml2{Xl)^Ml3{x2) 

and the functions $m/ are the usual L°°-normalized functions adapted to MI. 

If {xi,X2) := {x + l32t, X + (3^t) is in the support of ipi, then \x + (3it — c(Ji)|/|/| > M 
because 



|7i(x + /5it-c(/i))|> 



J]7,(x + /5,t-c(/,)) 



AM\I\ 



4M|/|. 



By computing derivatives, the claim for ipi follows easily. Let us only indicate the 
bound for the function itself: 

|?/'l(Xi,X2)| = \(t){x + Pit,X + l32t,X + j3^t)\^Ml2{x2)^Mh{X'i) 

< \I\-^'^\l2\-^'^\h\-^I^M-''x 

(1 + \I\-'\X + f52t - C(/2)|)-^(1 + |/rV + - C(/3)|)-^. 

If (x + [32t, X + P^t) is in the support of ip2, then either \x + /32^ — c(/2)|/|/| is of order 
at least M or |x + (3^t — c(/3)|/|/| is of order at least M, and the claim for %l)2 is again 
easy to see. 

Finally, it remains to show that ip has mean zero in the second variable. Consider 
first the particular case when 

A simple change of coordinates gives then 

^Ij{u,v) - 



l{-—U - —v)(l)2{u)(l)3{v) 

7i 7i 



For each u, the Fourier transform of ip in the second variable is supported in cug — ^^^i- 
Since this interval does not contain the origin, the claim follows. 

To obtain the general case expand as a series of products of the above, via windowed 
Fourier series. □ 
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Remark 4.1. Let (7) be the line spanned by 7, and {"y^"^^) be the projection of the line 
(7) onto the plane . We observe that the conditions on uj' in the previous lemma are 
actually equivalent to 

(4.7) u[xJ,n (7(')) = 0, 

(4.8) (7^')) = 0, 

In all applications of Lemma \4.5[ the box uo' will emerge in the following way. We will 
initially have a boxuo := {ui, UJ2, UJ3) which satisfies (14.71) and (14.81) . We will then choose 
some appropriate vector ^7 G (7) and define u' := u — ^7. Regardless of the choice of 

^, (I12D and dMD ^mply for i e {1, 2} that u'^ n ^[ = 0, since ^[ = - -^73 and 

5. Decomposition of A under the special cancellation conditions. 

In this section we will express the form A as a superposition of well localized model 
operators. Before achieving this goal we first recall how the decomposition was per- 
formed in the particular case of the bilinear Hilbert transform, see [TH] . More generally, 
assume K(t) is an x independent Calderon-Zygmund kernel with enough decay on the 
derivatives of K, and consider a A which for each /i, /2, /s G S{M.) has the representa- 
tion 

A(/l,/2,/3)=P.V j fi{x + Mf,{x + P2t)f3{x + P3t)K{t)dxdt. 

One first decomposes K in pieces Kk localized at frequency ~ 2~'^. Then, the modula- 
tion symmetry of A recommends a wave packet decomposition of the three functions, 
adapted to the scale of the multiplier. The geometry of the Fourier plane in conjunc- 
tion with the decay in the derivatives of K allows then one to reduce the boundedness 
of |A(/i, /2, /s)! to that of model operators of the form 

ap \Ip\~^'^\{fi^'Ppi){hAp2){hAp3)l 

pGP 

where P is the collection of all multi-tiles in phase space p = pi xp2 xps, Pi = Ipi^ 
with \Ip\ := = l/pj = l/pgl and Up. pairwise disjoint for each fixed p. The (pp^ are 
wave packets L^- adapted to the time- frequency tile pi and {ap} G i°°{P). 

A fundamental feature of P is that it is a one parameter family of multi-tiles, in that 
each cOp. determines uniquely the other two Up. . Another important aspect about P is 
the so called 'quartile property'; namely the fact that if for two multi-tiles p and p' we 
have Up. fl cUp^ 7^ then we are guaranteed that Up^ fl Up'^ = for each j ^ i. Both of 
these properties follow as a consequence of the fact that the cubes tUp^ x Up^ x Up^ are 
located at some uniform distance from the line (7) and they touch the plane (1, 1, l)"*". 

In the case K depends on both x and t we proceed differently. Consider again a 
Calderon-Zygmund trilinear form A which is modulation invariant in the direction 7. 



MODULATION INVARIANT BILINEAR T(l) THEOREM 



27 



We saw that we can write 

A(/l, /2, /s) = {K{X, ® /2 ® /3(X« + m 

where we still denote hj K a. distribution in 5'(]R^), which agrees with the given 
kernel when t 7^ 0. Note that the Calderon-Zygmund conditions on the kernel do not 
say anything about the distribution K for t = 0. As in the linear case, the weak 
boundedness property and the conditions Tj{l, 1) G BMO are needed to complete, in 
a certain sense, the control on the distribution K. 

To study A we will perform a particular Whitney decomposition on the frequency 
domain. We want to give a heuristic motivation for it. Based on the experience 
with multipliers (both linear and multilinear ones), one expects that a Mihlin-type of 
behavior for the frequency representation of A should play an important role. That 
is, the form should be given in the frequency side by a distribution whose derivatives 
behave like the reciprocal of the distance to a particular singular or bad set. Typically 
one then performs a Whitney decomposition with respect to that set. 

Applying the Fourier inversion formula to each fj we see that , in the sense of distri- 
butions, we can represent A on the frequency domain as 

(5.1) A(/i, /2, h) = {K{-a ■ e, • 01/1 ® /2 ® M)), 

where ^ = (^1,^2,^3)- The conditions on the kernel are too weak to conclude any 
pointwise kind of behavior for K. This will substantially complicate our analysis but, 
intuitively, Z?-*" should be a bad set for A. To further motivate the analysis to be 
performed, consider a very particular case of K with compact support, smooth in x, 
and satisfying for t 7^ the stronger conditions 

\d'^K{x,t)\<\t\-^'+\^\\ 

for all |a| > 0. These conditions (together with the weak boundedness property) imply 

(5.2) \d::^K{u,v)\<\v\-^, 

for f 7^ and m > 1. The estimates say that the derivatives d^^K are only singular 
at the origin (though they still do not say anything about K itself). This and the 
representation (15. ip suggest that A may have some singularities on Z?"*". 

The representation (15.11) is not unique in the sense that it depends on (3. In fact, for 
duality purposes and to exploit the conditions on the operators Tj, we can also write 
by simple changes of variables 

A(/i,/2,/3) = {Ks{x,t)\fi®f2^Uxa + t(3^)) = m^J^M) 
A(/i,/2,/3) = {K2{x,t)\f,®f,®f,{xa + t[3')) = {Uh,h)\f2) 
A(/i,/2,/3) = {K,{x,t)\h®h®h{xa + t(3^)) = (Ti(/2,/3)|/i), 

where the vectors (3^ are still perpendicular to 7 and satisfy that the component /5j of 
them is zero. The Kj are also related to by a change of variable and still satisfy the 



28 A. BENYI, C. DEMETER, A.R. NAHMOD, CM. THIELE, R.H. TORRES, AND R VILLARROYA 



same conditions assumed on K. In other words, 

T3(/i,/2)(a;) = j h{x + pft)f2{x + Plt)Ks{x,t)dt 

and similarly for Ti and T2. 

We obtain then the three frequency representations 

(5.3) A(/i, h, fs) = {K,{-a ■ ^, -(3^ ■ Ol/i ® /a ® /slO)- 
In each of them, away from a plane through 7 defined by 

Pj ■={f3^)^ = span(ej,7), 

the form A is given by a symbol whose derivatives blow up according to (15. 2p when 
^ approaches Pj. We use a Whitney decomposition that simultaneously resolves all 
of the three singular sets Pj independently of which Tj we are using. In a way, the 
conditions Tj(l, 1) G BMO are needed to control the behavior on the bad set 

S:=[jP, 

i=l 

and eliminate the potential singularities of A on it. 

We see from fl5.3p that, at least formally, 1) = translates into 

(^.(-0,0)1^(0)) = 0, 

and hence Kj{u,0) = 0, so A vanishes in some sense on S. We will show rigorously 
that we can perform our analysis in \ S" in Lemma 15.31 below. 

In the case of the bilinear Hilbert transform (or a kernel that is x-independent), the 
formula in (15. ip takes the simpler form 

(5.4) A(/i, /2, h) = {6{a ■ OK{-P ■ Ol/i ® ® M)), 

which clearly vanishes if /i (S> /2 ® /a is supported away from a"*" (which also contains 
(7)). We see then that A is supported on and hence is possibly singular only on the 
line (7) independently of the representation used. A one-parameter family of boxes, i.e. 
cubes, is then used in this case as mentioned above. In the x-dependent case, however, 
a two-parameter family of boxes in will be used to decompose the complement of 
the bad set as we will now describe. 

We first produce a Whitney decomposition of the frequency domain M'^ \ S* into tubes 
as defined below. This is achieved by combining two-dimensional Whitney decompo- 
sitions of each ej- \ {'j^-'^)- In a second stage we perform wave packet decompositions 
of the three functions adapted to such tubes. Finally, information on the decay of 
the coefficients associated with various localized pieces will be provided via the almost 
orthogonality estimates of the previous section, combining the kernel representation 
with the weak boundedness condition and the cancellations Tj(l, 1) = 0. 
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Definition 5.1. A dyadic box is a parallelepiped tui x ti;2 x ^3 such that all intervals 
iOi -called the sides of the box- are dyadic intervals. A tube is a dyadic box where the 
minimal side-length is attained by two of the sides. We also allow the maximal side 
to be all of M. The orientation of a dyadic tube is the direction corresponding to its 
longest side (cubes have no orientation). 

Observe that a nonempty intersection of two dyadic tubes of possibly different ori- 
entations is again a dyadic tube. 

Let Ci ^ 1 be a sufficiently large constant depending on 7, whose value will not 
be specified. Constraints on how large Ci should be will become apparent throughout 
the paper. The other important constant^ that will appear throughout this work are 
C2 := Cl'^ and C3 := Cf. 

For each j G {1,2,3}, we decompose the complement of the plane Pj into the 
collection Vtj of minimal dyadic tubes uo such that Ciuo intersects the plane Pj. An 
important property that will be used repeatedly is that for each such tube we have 
^uj n (7) = 0. These tubes partition the complement of Pj, they are infinitely long 
in direction Cj and their projection onto the orthogonal plane to Cj defines a standard 
Whitney decomposition into squares of ef \ (7*--'-'). 

Now let VL be the collection of all tubes which are nonempty intersections of three 
tubes as above, one in each of fii, and ^3. Then VL partitions the complement of 
the bad set. Each of the tubes uj in this partition, initially defined as the intersection 
of three tubes, is actually determined by the intersection of two of the three tubes, one 
defining the two shorter sides of uj and the other one defining the long side of uj. 

For each Whitney tube c<j G i7 we consider the box Scj, which is the box uj dilated 
about its center by a factor 3. By a standard argument for Whitney decompositions 
(e.g. applied to the Whitney decompositions of each of the three planes) these boxes 
have bounded overlap and two overlapping boxes have comparable side-length in each 
dimension. Therefore we can find a partition of unity of the complement of S 

where each 0^ is supported in 3uj and L°°-adapted to u. 

Definition 5.2. The width of a tube is the length of each of the shorter sides, its length 
is the length of its longest side, and its eccentricity is the ratio between its width and 
length. 

Lemma 5.3. Assume A is a trilinear C alder on- Zygmund form with modulation sym- 
metry in direction 7 that satisfies the weak boundedness property (13. 8p and the special 
cancellation conditions Tj(l, 1) = for 1 <i <?>. 

Let Vt^^'^ be the set of all tubes in Vt with width at least . Then we have 

(5.5) A(^) = lim Y A{iP * 0^) 

fc— >oo ^ 



'From now all occurrences of Ci , C2 and C3 will refer only to this constants 
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for each Schwartz function ip with compactly supported Fourier transform. 

Proof. We will assume without loss of generality that ip is supported into the cube 
[— 1, 1]^. For each k E Tj we construct a set fl^'^^ of tubes of width such that the 
tubes in fi'^'^^ U (l^'^'' form a partition of M^, their dilates by a factor of 3 have bounded 
overlap, and which have the property that if 3a; fl 3a;' 7^ for some cu, uj' G fi^^^ U Vl'^^^ 
then UJ and uj' have comparable side-length in every dimension. To achieve this, let 
f2j be the subset of Vtj consisting of all the tubes of width at least . Define also 
Vtj^k to be the (uniquely determined) collection of tubes infinitely long in the direction 
Cj with width such that VLj^k U fij.fc forms a partition of M?. 

Define now f2*-'^^ as the collection of all tubes that arise by intersecting 3 tubes, one 
in each of Vti^j. U ^i.k-, ^2,k U ^^2,fc and Q^^k U ^^3,^, with at least one of the three tubes in 
some Note that ^l^''^ consists of all tubes that arise by intersecting 3 tubes, one 
in each of ^i^k, ^2,k and ^3^k- It is now an easy exercise to prove that i^^'"'^ U fl^''^ is a 
partition of that has all the desired properties. 

Using these properties and the standard process of partition of unity, we may define 
functions (p^j for uj G (l^''^ that are L°°-adapted to uj and supported in 3a; so that 

Recall that the functions 0^ with uj G fi^^-* have been defined earlier. 

It is not hard to observe (see also the rank properties in section 16. ip that 

(5.6) C^u n (7) ^ 

for each uj G Q^''^ U Q^^l There are only finitely many tubes in 

which intersect the compact support of ip, hence clearly 

(5.7) A(^)- J2 A(V^*0J= J2 ^(^*0-) 

It then suffices to show that the right hand side tends to as A; tends to 00. For the 
rest of the proof it suffices to restrict attention to tubes that intersect the support of 
ip and to sufficiently large values of k. (In particular, we can assume A; > 0) 

We first estimate the contribution coming from the collection Vl^'^'^^ of tubes in (l^'''^ 
whose eccentricity is 1 (the cubes). Due to fl5.6p there are 0(2^^) tubes in To 
estimate the contribution of a tube in to (15.70 we use the modulation symmetry 

of A to get that A{ip * (p^) = A{M^^{ip * (puj)), where ^ G M is chosen in such a way 
that the support of the Fourier transform of M^^{ip * 0^.) lies inside the cube centered 
at the origin with side-length 100C32~'^. This is possible due to (15.61) . It follows now 
easily that the Fourier transform of M^^0^ is L°°- adapted to the cube of sidelength 2~*^ 
centered at the origin. Since k > 0, the same can be said about the Fourier transform 



MODULATION INVARIANT BILINEAR T(l) THEOREM 



31 



of M^^lip * (pu))- This easily implies now that M^^{iIj * (puj) is L^- adapted to the cube 
of sidelength 2^ centered at the origin. Applying inequality (13.81) we obtain that 

|A(M,^(^*0.))| <2-2^ 

Thus, the contribution of Vl^^'^'^ to (EZD is 0(2-'').^ 

It remains to consider the set l^'^'^'^^ of tubes cu G l^*-^^ whose eccentricity 2~'^ is smaller 
than 1. Without loss of generality we can assume that the sides of u have lengths 2"^^, 
2~^ and 2'^^'^ in this order. It follows that u is determined by the intersection of 3 
special tubes u' ■.= M. x ui i x cui 2 G Qi^k C Qi, u" := 072,1 x M x 0^2,2 ^ ^2,k C. Q2 and 
u'" := tus,! X cj3,2 X M e f^3,fe- 

First observe that if such a tube u produces a nonzero contribution to our sum then 
its sides have lengths smaller than 1. This is immediate for the smaller sides whose 
length is 2"^^. Let us now see that the same thing is true for the longer side. We observe 
that u must intersect the cube [—1, 1]^, which implies (072,1 x 072,2) H [—1, 1]^ 7^ 0. But 
since u" GC Q2, we know that ^ 10(a;2,i x 072,2)- This proves |o72,2| < 1- As a 
consequence, we deduce that the Fourier transform of ip * (p^j is L°°- adapted to the 
tube uj. 

Choose now ^ G M such that G (73(07 — ^7) (this is possible due to (15. 6p ). Using 
again the modulation invariance of A and (15. 6p . we get as before that 

|A(^*0J| = |A(M5,(^*0J)|, 

where M^^{iIj * 0^) is L^- adapted to the box centered at the origin with side- lengths 
comparable to {2^, 2^, 2'~'^2^). Moreover, the Fourier transform of the function M^^[ip* 
(j)^) will be supported in the box 07 — 7,^ which is easily seen to satisfy the requirements 
of Lemma 14. 5[ once we prove that 07 satifies the requirements (14.71) and (14.81) in Remark 
14. 1[ But this is immediate since (c72,i x c72,2) n (7^^^) = and {u^^i x ^73^2) n (7*^^^) = 0. 
Lemma 14.51 now gives 

\A{M^^{ij * 0.))| < 2-'=/22-'^(i/2+'5')2-=^^/2+'^/2 

where the factor 2~'^'^/^+'^/2 adjusts the normalization of ip*(j)u- Finally note that for 
each K > there are 0(2^) tubes 07 as above. To see this, we can further assume without 
loss of generality that 07 is determined by 07" and 07"', that is 07 = (073,1, 073,2, 1^2,2) • Note 
first that there are 0(2'') tubes 07"' = 073,1 x 073,2 xM of width 2~'' which intersect [—1, 1]^, 
since Ci (073,1 x 073,2) H (7^^-') 7^ 0. Given 073,1, we know that 072,1 is determined uniquely, 
and then 072,2 is determined within finitely many choices by 072,1, since Ci (072.1 x 072,2) H 
(7(2)) ^ 0. Hence we can estimate 

^gQ(fe,2) K>0 

which is again an acceptable contribution. 

□ 

As we proceed with the second stage of the decomposition of A, we return to viewing 
A as a trilinear form on the triple product of Schwartz spaces, rather than a distribution 
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on M.^. Thus we assume in Lemma 15.51 is an elementary tensor V' = /i ® /2 ® /s 
of three compactly supported smooth functions. To turn into a convergent sum of 
elementary tensors, we invoke Fourier series. 

For each tube G i7 we choose functions 0^^^ for i = 1,2,3, adapted to Ui, 
constant equal to 1 on Sui and supported on 5uJi. The dilated tubes 5uj are still 
disjoint from the bad set 5" since Ci ^ 1. Then 

3 
i=l 

Applying Fourier series on 5uj gives 

3 

0.(0 = E ^-'"i-"^."3 n 0-.te)e'™"«'/(^l-^'\ 

rai,n2,n36Z i=l 

and note that for each M the coefficients decay as 

(1 + max(|rai|, |n2|, l^sD) 

We also note that since 0^^. is L°°- adapted to uji of any order M and supported in 
bui, so is the function M^^/^^^^^cp^^^, with a constant that is 0{nf^). These observations 
imply that 

,ni,n2,n3 A(/i 

* Tni/5|a;i|0.i, /2 * ''"712/51.2 1 r'.2 ) /3 * ''"na/Sl.al'P.a J- 

Another immediate implication is that it suffices to bound 

XI * 'i^^i ' /2 * 0.2. h * 0.3) I 

wen 

uniformly over all functions 0^;, such that 0^. is L°°- adapted of order -say- 2N and 
supported in bui. 

By Shannon's sampling theorem, we can write for each such function 0^. and each / 

u 

where Jj runs through all dyadic intervals of length (IGlciJjl)"^ and M_c[uji)4'iuuji is an 
L^- normalized bump function adapted to Jj of order 2N such that 0/^,.^ is supported 
in ScUj. 

We then estimate A(/i, /2, /a) by 

3 

(5-8) E E |A(07,,^,, 072,<^2, 0/3,^3)1 JJ |(/i,0/^,^J I 

.en/1,/2,/3 j=l 

Next, observe that for each uj E with width 2^^ and eccentricity 2"*^ and each 
5' < (5 

|A(0/, ,.1 ) 0/2 ,.2 ' ,.3 ) I 
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(5.9) < 2-'=/22-''(^'+i/2)(i + 2-'=diam(/i, h, h))-'-'' {I + 2'''\J2l^c{I^)\r''■ 

This will follow from Lemma 14.51 Indeed, reasoning as before, we can find ^ G M 
such that the tube a; — ^7 is contained in some tube centered at the origin with width 
1006*32"^ and eccentricity 2~'^. Since |^7i — c(ci;j)| = OdtUjl) for each i, it follows that 
M_^^(07^^(^-^ ® 0/2,1^2 ® 4'h,ijj-i) is L"^- adapted to the box Ii x I2 x I^. The fact that 
uj satisfies the requirements (14. 7p and (14. 8 p (and actually all the other symmetric 4 
identities) in Remark 14.11 is immediate (see the proof of Lemma [5.3p . 

Using these estimates we shall restructure the sum in (15. 8p and extract the main 
terms. 

First we shall use symmetry to reduce to the case where the sum runs over all tubes 

(5.10) iU = {iUi,iU2,iU3) := (tUi,! X LUi^2 X ^012,2) 

such that 002,2 is the longest side and u is determined by intersection of the tubes 
uj' = uji^i X uji^2 X M G 1^3, cj" = M X uj2,i X 002,2 G fil and 00"' = oo^^i x M x 003^2 G ^2- 

At the expense of replacing the exponent 6 in (15. 9p by a slightly smaller 6' it suffices 
to consider only those tubes 00 with a fixed eccentricity 2~'^ and those triples of intervals 
for which 

(5.11) 2"^-^<diam(/i,/2,/3)/|/|<2"^ 

(5.12) 2"^-^ <diam(/2,/3)/|/| 

for some fixed m and prove summable bounds in k, and m. Here we use again the 
notation |/| := = |/2|. 

Next, we shall use the rapid decay in the last factor in (15.90 to argue similarly 
to above that one only needs to consider those terms for which this factor is large. 
For fixed I3 and I2 we choose an interval Ji satisfying (15. lip and (I5.12p . for which 
^jljc{Ij) is minimal. For any other interval Ji satisfying (15. lip and (I5.12p . we note 

that the function M-c{uji)<Ph,u}i is adapted to Ji of order A^. The constant of adaption 
increases like (l + dist(c(/i), c(/i))/|/|)^, which is offset by the last factor in (15. 9p since 



|7i||c(/i)-c(/i)|< 



0{I). 



Thus it suffices to consider only Ji. We shall write again Ji for Ji and we shall main- 
tain from the above discussion that we are summing over a two parameter family of 
intervals /i, /2, -^3 such that Ji is determined by I2 and I3. Likewise we may assume I2 
is determined by J3 and Ji. Note that we cannot do the same for /s, as I3 is potentially 
much smaller than Ji and I2 and there may be many intervals I3 (about 2'^) which 
maximize the last factor in (15.90 . for given Ji and l2- 

To summarize the above reductions, let fi* denote all tubes as in (I5.10p . which have 
eccentricity 2"*^. Let Tmi^o) denote the set of all triples {Ii, 12,13) associated with uj 
as above, satisfying (15.110 and such that both Ji and I2 are uniquely determined by 
the other two intervals. Then it suffices to get uniform bounds in m and n over all 



34 A. BENYI, C. DEMETER, A.R. NAHMOD, CM. THIELE, R.H. TORRES, AND R VILLARROYA 



functions (pi^^^Ji such that M^c{uj.)(f)j-^i^. is L^- adapted Jj of order A^, for the following 
sum 

(5.13) E iAr^/^2-'^(V2+5')2-(i+^') Hi i, 

where 5' < 5 will be chosen conveniently (see Theorem 15.50 . 

It will be convenient to associate with uj and (Ji, J2, /s) as above 

(1) For each i = 1,2,3 a dyadic interval Up. of length 2'^\u!i\ which contains the 

support 8uJi of the function 0/.,^.. 

(2) For each i = 1,2,3 a dyadic interval cu/j. of length 2^|c(j3| which contains Up-. 
Note that uji^ = Up,^. 

(3) For each i = 1, 2, 3 a dyadic interval Ip. of length IcUpJ^^ which is contained in 

u 

(4) A dyadic interval Ir of length 2^ x 2™"!/!! which contains Ji,/2,/3. 

It is clear that such intervals exist. There is no deep reason we choose to modify Jj to 
Ip- in (3), we only do that so that we have |Jp-||c(jp. | = 1. 

These intervals in general might not be standard dyadic intervals. However, we can 
choose them to be generalized dyadic intervals. We briefly describe this construction 
and refer the reader to [10] for more details. Let g be a large prime and define a 
generalized dyadic interval to be one of the form [2"fc/g, 2*^(^/^ + 1)) for integers k and 
n. If k is restricted to a fixed residue class modulo q, the collection above forms a grid, 
in the sense that all intervals of fixed scale form a partition of the real line and any 
two intervals in the grid are either disjoint or one contains the other one. 

For every interval / , there are q generalized dyadic intervals, which we call covers, of 
length strictly between q\I\ and 251/1 (thus the length is a uniquely determined power 
of two) which contain I. More precisely, each grid except for possibly one of the q grids 
contains such a cover for /. Therefore, for every collection of less than q intervals, there 
is one grid which contains a cover for each interval in the collection. 

Using this method, for q large enough we can choose for each u and Ji,J2,/3 the 
intervals listed above to belong to one of the q grids. We will work with g = 11. Since 
there are only 11 grids, it suffices to consider the sum (15.131) over each grid separately. 
For simplicity of notation we shall only discuss the standard dyadic grid, which is one 
of the 11 grids. Since we only use the grid properties of the dyadic intervals, our proof 
will easily transfer to the case of the other grids. 

The above intervals determine rectangles in the phase plane: i?j = Jj? x ujr- and 
Pi = Ip. X Up-. We will denote by R the 3-tuple of rectangles (i?i, R2, R3) and will call 
it a multi-rectangle. Similarly, p will denote the 3-tuple of tiles (pi,P2,P3), which will 
be referred to clS 8b multi-tile. 

We discuss the important properties of the triples R and p other than the obvious 
containment properties visible in the figure below. First, by a further splitting into 
finitely many collections it suffices to assume that each of Ri, R2, R3 determines the 
other two. As all three rectangles have the same spatial interval Ir, it suffices to show 
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this for the frequency intervals. This in turn follows from the fact that the frequency 
intervals have equal side-length and from the eas}{^ observation that 

(5.14) C^{ujR, X ujR, X ur,) n (7) ^ 0. 
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Similarly, we can assume that determines both and Up^, i ^ j G {1,2}. The 
fact that cUp- determines ujp^ within finitely many choices follows from the fact that 

Ci(cup, xcupjn(7(=^)) ^0, 

a consequence of the special representation oj = uj' Huj". Note also that Up. determines 
ur^, which we have proved to determine ljr^ = ojp^. We must point out however that 
uip^ only determines Up^ within 0(2*^) choices. 

On the spatial side, we observe that the intervals Ip^ and Ip- determine Ip., this 
property being reminiscent of the similar property shared by the intervals This 
implies that in the triple (^1,^2,^3) we can assume that pi,P3 vary freely and determine 
P2 and also that p2 , Ps vary freely and determine pi . We recall for comparison the fact 
that in the case of x independent kernels K, each pi, I G {1, 2, 3} determined uniquely 
the other two (See the discussion in the beginning of this section). 

The area of any rectangle Ri is A = 0(2'*"'"™'). If we incorporate the latest reductions, 
we denote by p the family of all multi-tiles p as above. We will also denote by R the 
family of all multi-rectang le^ R (of fixed area A) associated with triples p G p. For 
any fixed triple R, we denote by p(-R) the set of all multi-tiles p G p that are contained 
in R. 

Finally we notice that for any given i? G R there are at most A'^ multi-tiles p G p(-R). 
On the other hand, any given p G p determines a unique i? G R such that p G p(-R). 
Let us see this latter point. Given p = {pi,P2,P3) we know that ps determines ur^ 



'For similar results, see the rank properties in Section [6.11 

From now, we will stop indexing the dependence of various collections like p and R on k and m. 
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which determines both ujr^ and ujr^. On the other hand, both the length and the 
position of Ir are determined by /p^, since k and m are known a priori. 
With these notations we may rewrite (15.131) as 

(5.15) A-^'Y, Yl i^/?rVpar/'rii(/.0p.)i, 

ReR.pep{R) i=l 

where we can take < 6' < 6 as close to 6 as we want. We will prove bounds for this 
sum that are then summable in k and m. 

For each measurable subset C M with finite measure we define 

XiE) = {/ : I/I < 1e a.e.}, 

X,iE) = {/ : I/I < \E\-'/He a.e.}. 
A major subset of a set E is a subset Eq G E such that \Eo\ > \E\/2. 

Definition 5.4. Let a be an 3-tuple of real numbers and assume aj < 1 for all j G 
{1,2,3}. A 3-sublinear form is called of type a if there is a constant C such that for 
each finite measure tuple E = {Ei, E2, E^) , there is an index jo and a major subset 
Ejg of Ejg such that for all tuples f = (/i,/2,/3) with fj G X{Ej) for all j 7^ jo find 
fjQ G X^Ejo) we have 

3 

iA(/i,/2,/3)i<ni^^-r^- 

i=i 

We will work with A(A, f,, f,) = E^^R Epep(ii) IIrI^'M^' Uli I I- In the 

next section we will prove the following theorem. 

Theorem 5.5. Let jo G {1,2,3}. For each a in the triangular region defined by 

ai + ^2 + as = 1 
1/2 < aj^,aj^ < 1, forji,j2 e {1,2,3}\ jo 
max(— (5, —1/2) < aj^^ < 0, 

we have that A{fi, f2, fs) is of type a with bound C = 0{A^"), for some 6" < S 
depending only on a. 

By choosing then in fl5.15p 6' > 6" we obtain that the original form A is also of 
type a in the same regions. Since the convex hull of the three triangular regions in the 
above theorem is the region characterized by the restrictions 

ai + a2 + as = 1 

max(-5, -1/2) < aj < mm{6 + 1/2, 1), for all j. 
Theorem 12.71 follows then by invoking multilinear interpolation as in [23], |28j . 
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We remark that the case A = 1 in Theorem 15.51 corresponds to the situation when 
the form A is associated with an x independent kernel K{t) (see also the discussion at 
the beginning of this section). 

Finally, to prove Theorem 15.51 we will assume that 

(5.16) R,R' eR and < implies C3A\Ir\ < \Ir:\, 

at the expense of considering log(C3y4) families in scales. 

6. The boundedness of the model sums 

6.1. Rank, trees and sizes. We will prove that if Ci is sufficiently large we have the 
following: 

Rank properties of p 

Let p G p(-R) be a multi-tile that is associated with a tube u as in (15.101) . according to 
the procedure described earlier. Let (^^,^^,^^) G (7). 

(1) (3-ovelapping implies j-lacunary; tile version) If G 2ujp.^ then for each j G 
{1,2} we have ^ C2UJp. and C,^ G C^Aujp^. 

Proof. We will consideiH the case j = 2. Recall that u C uj" = x 002,1 x 
'^2,2- If € C2UJp^, then this together with G 2ujp._^ would force ^{id2,i x 
^^2,2) n (7''^^) 7^ 0) contradicting the construction of u". On the other hand, by 
construction it follows that there is some (^^, ^^) G (7'-"'^^) nCi (072,1 x 1^2,2)- Note 
that — ^^1 < Ci\ujp,^\. This implies — < y^q^sI'^psI- This together with 
the fact that G Cia;2,i, implies G (73^72,1. Finally, this together with the 
fact that LJ2,i C Aup^ implies ^'^ G CsAup^. □ 

(2) (j-ovelapping implies 3-lacunary) If ^-^ G 2ujp^ for some j G {1,2} then we have 
^3 ^ C2Wp3 and ^3 G Csujp^. 

Proof. A similar argument as for (1) applies here. □ 

(3) (j-ovelapping implies i-lacunary) If ^-^ G 2ujp^ for some j G {1,2} then for 
i = 3 — j we have ^ C2i^pi and G C^Up.. 

Proof If G C2Cc;p^ then this together with G 2ujp^ forces (^\^^) G (7(^)) fl 
C2(^^Pi X ^^^2); contradicting the construction of u'. On the other hand, from the 
construction of u' we know that there is some {$,^,^'^) G (7^^-') H Ci{ujp-^ x cUpJ. 
Since |^-^ — ^-^1 < Cijc^pJ and G CiUp-, it follows that G CsUJp-. □ 

(4) (3-ovelapping implies j-lacunary; rectangle version) If ^'^ G 2ujfj,^ then for each 
j e {1, 2} we have ^ C2u;ij^ and G CsUr^. 

Proof. Similar arguments apply here and for the rest of the properties. □ 



'For j — 1 one has to use the fact that uj C oj'" :~ ((jJ3,i, M, a;3,2)- 
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(5) (j-overlapping implies 3-lacunary; rectangle version) If j G {1, 2} and G 2ujr^, 
then ^ C2UJji^ and G Cscu^g. 

(6) (j-overlapping implies i-C^ overlapping; rectangle version) If i,j G {1, 2, 3} and 

G 2cjH^, then C ^ C^ur^. 

Definition 6.1. Let i G {1,2,3}. An i-tree (T,Pt) with top (/t,^x)' where Jx is 
a dyadic interval, is a collection T C R o/ multi-rectangles R such that Ir C Jrp, 
together with a collection px = IJ/?gt p(-^) T) of multi-tiles satisfying p{R, T) C p(-R) 
and ^x ^ 2ci;p. /or eac/i i? G T and each p G p(-R, T). 

A 0-tree (T, px) with top (/x,^x)' where Jx a dyadic interval, is a collection 
T C R o/ multi-rectangles R such that Ir C Jx, together with a collection px = 
Urgt p(-^' T) o/ multi-tiles satisfying p{R, T) C p(-R) and either 

(1) G 2a'ij^ for each R E T and ^ 2u;p^, ^ 2co'p2 for each p G p(-R, T), where 
(Ct'^t) ^ (t*'^'') (this will be referred to as 0^-tree) 

or 

(2) G 2u!r^ for each R E T and ^x ^ '^^pi, ^ '^^P2 /^'^ ^^^/i p G p(-R, T), where 
(^T'^t) ^ (7*'^'') (this will be referred to as O'^-tree). 

Given a dyadic interval I and a point ^ = (^^,^^,^^) G (7), define the saturation 
S{I,^) of the pair (/, ^) to the set of all multi-tiles which lie in the union of the maximal 
1-tree with top (/, ^^), the maximal 2-tree with top (/, ^^), the maximal 3-tree with top 
(/,^^), the maximal 0^-tree with top (/, ^^) and the maximal 0^-tree with top (/, ^^). 
Note that actually 

3 

(6.1) 5(/,a=u u p(^)- 

We state three easy lemmas for future reference. 
Lemma 6.2. If G 2ljr^ for some I G {1,2,3} and IrC I then p(i?) C S{I,C)- 
Lemma 6.3. If G 2ujr., Ir/ C / and ujr. C ur', for some i,j G {1,2,3} then 
p(i?')c5(/,0. 

Proof. We know from rank property (6) that G C^ujr^, and from fl5.16p we deduce 
that G 2ujRr . The conclusion now follows from the previous lemma. □ 

The following is an immediate consequence of the rank properties. 

Lemma 6.4. If i G {0, 1, 2, 3} \ {j}, (T, px) is an i-tree with top (Jx, ^x) ^'^^ P ^ Pt 
thei^ ^ 2ujp. and ^ C^Aup. . 

'^Here is the j'*'' coordinate of the vector G (7) which is uniquely determined by the coordinate 
^^p. If i 7^ 0, then is the i*'* coordinate of this vector, while if i = 0, it is either the first or the 
second, depending on whether (T,pt) is a 0^-tree or a 0^-tree. 
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Remark 6.1. Lemma [gi^] ^^^^ f l5.16p imply that for each i G {0, 1,2,3} \ {j}, each 



i-tree (T,pt); each R, R' e T with < {Ir^I and each p G p(i?, T), p' e p(i?',T) 
we have Up^ fl Up', = 0. 

For each subcoUection p*(-R) C p{R) and each / G {1, 2, 3} we will denote P;*(-R) := 
{pi : p G p*{R)}. For the simplicity of notation we will sometimes write T instead of 
(T,Pt). 

Definition 6.5. Fori G {1,2,3}, the tile size size/^^ of a collection^* C p of multi-tiles 
with respect to a function fi is defined as 

|2\ 1/2 



sizez,i(_pj := sup i , 

ReR \ Mfll 

w;/iere p*{R) := p(i?) np*0 

Definition 6.6. For z 7^ j G {1, 2, 3} and for j G {1, 2} anc? i = 0, the tree size sizej^j 
0/ a collection p* C p 0/ multi-tiles with respect to a functioi^ fj is defined as 

, *N /^Si?,eT Zlp,ep,(R,T) I (/i' <^Pj) '^^ 
sizej-i(p ) := sup ' , 

T \ MtI 

where the supremum is taken over all i-trees (T,pt) with px C p*. 

We will estimate the model operator associated with each tree (T, px) by first sum- 
ming in the third variable and then applying the Cauchy-Schwartz inequality in the 
first two variables. In doing so we recall that Pi and ps determine pj uniquely 

3 

ReT pep{R,T) 1=1 

<Ei^«i"'( E i(/i><^p.)n^/'( E i(/2>0P.)n'/^ E i^p^r/^K/s, 0.3)1 

ReT piepi(i?,T) p2ep2(i?,T) p3Gp3(R,T) 

ReT piepi(i?,,T) P2ep2(R,T) P36P3(R,T) 

This estimate is refined as follows, depending on the type of tree we are dealing with. 
For a 3-tree (T, px) we write 



J^R^T SpiGpi(_R,T) I (/l' ^Pi) P^ [ X]/?gT Sp2Gp2(-R,T) I (/z, 0p2) '^^ 



At(/i,/2,/3)<|/t| 



a given tile pi may correspond to more multi-tiles p in p(i?.) or in p*, it will be counted 
only once in each summation. 

^^The size will not be indexed by fj, the function with respect to which the size is measured will 
always be clear from the context. 
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( Z]p3ep3(R,T) I (/S) ^pa) '^^ 
X sup — - — — -T— 

\-ReT \Ir\ 

< |/T|sizei,3(pT)size2,3(PT)size3,3(pT). 
An identical estimate shows that if (T, px) is a 0-tree then 

At(/i,/2,/3) < |/T|sizei,o(pT)size2,o(PT)size3,3(pT). 
For an z-tree with i G {1,2} we write 

1/2 /.^ \ i£ 1 \ i2\ 



At(/i,/2,/3) < |/t| f ^^^^^'^-^^yj^^'^^^''^ sup |^ ^P^^P^(^.T)I(/»'^P.) 

Si?,eT Sp3gp3(i?,,T) I (/3,0P3) 



l-^Tl / i?.eT \ \J-R\ 

1/2 



I/' 



T 



< I It I sizej- i (pT)sizei,i (pT)size3,i (px) . 
We thus see that if a collection p* of multi-tiles is organized as a disjoint union 
of trees, p* = U(T,PT)e.F Pt, then 

3 3 

(6.2) |Ap.(/i,/2,/3)| < (5^n^i^^^-.^(P*))(E I^tI), 

i=o j=i Ter 

where for the purpose of keeping the notation symmetric we denote size3,o '■= size3_3. 

This inequality sets up the strategy for the following sections, where we will split p* 
into collections which can be organized into trees, with good control over both their 
sizes and over the norm of the counting function of the tops of the trees. 

6.2. Bessel type inequalities. For each j G {1, 2, 3}, each i? G R and each p*(-R) C 
p(-R) we will use the notatioiJll 

Pjep*(^f) 

If R belongs to a tree (T, pt) and p*(-R) = p(-R, T), then the notation rp(/), S-'^r^{f) 
will be preferred. 

An immediate consequence of Lemma 14.11 is the fact that 

(6.3) Iin.p.(.?,)(/)ll2-5^^,p*(^0(/)^ll/ll2 



"'^^When no confusion can arise, we will suppress the dependence on p*(i?) of r^(/) and 5;^(/). 



MODULATION INVARIANT BILINEAR T(l) THEOREM 41 

for each / G L^, each R and each p*(-R) C p{R). Similarly, due to Remark 16. ![ for 
each i e {0, 1, 2, 3}, each j G {1, 2, 3} \ {i} and each z-tree (T, px) 

(6.4) Ell^iT(/)l|2^E^iT(/)<ll/l|2- 

Definition 6.7. (M-separated rectangles) We say that a family R* of rectangles R = 
Iji X Ljji is M-separated if for any R,R' G R*, R ^ R' , we have that {MI^ x lor) fl 

{mIr> X oj^jO = 0- 

Lemma 6.8. Let R* be a finite family of pairwise disjoint rectangles. Then, for each 
M > 1 we can find a subfamily R C R* that is M-separated and satisfies 

\Ir\<^mJ2\Ir\- 

Proof. Fix M > 1 and define Rm = MIr x uor. We select recursively rectangles 
i? G R* with maximal and with the property that -Rm H R'^ = for all previously 
selected rectangles R'. When this procedure ends, we get a family of rectangles R = 
{RK.., i?"} C R*. We now define 

R* = {i? G R* : Rm r}R\j^^ , Rm n = for A; < i}. 

Then it is clear that R is M-separated and that 

n 

R* = |Jr\ 

i=l 

thus 

E = t E i'hI 

_ReR* 1=1 _ReR» 

Moreover, < \lRi \ for any i? G R*. Otherwise, if > \lRi\, by the maximality 
condition of length and the fact that Rm H R\^ = for all k < i, R should have been 
chosen instead i?*. This together with the observation that MIr fl Ml^i ^ for each 
i? G R' implies that Ir C 3M%. Thus, 

(6.5) U IrCSMIr.. 

R&R' 

On the other hand for each R, R' G R* we know that uiRi C Ur fl ujr/ ^ 0, which 
together with i? fl i?' = implies that Ir fl Ir' = 0. This together with (16.51) implies 
that 

n 

\Ir\<3MY\Ir.\=3MJ2\Ir\. 
ReK* i=i R(z^ 

□ 

Definition 6.9. Let j G {1,2,3}. 

(1) We say that two trees (T, pt) and (T', pt') are disjoint «/T n T' = 0. 
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(2) We say that two trees (T,pt) and (T',pt') with tops {It,C,t) and {It',C,t') are 
j -strongly disjoint if they are disjoint and satisfy the following: 
ifpe p{R, T), p' e p{R', T), Re T, R' e T, C Up'^, then Ir, n It = 

A family of trees is said to consist of j -strongly disjoint trees if any two trees in the 
family are j- strongly disjoint. 

The following lemma will be the main tool in dealing with the tile sizes. 

Lemma 6.10. Let j E {1, 2, 3}, f E L"^ and A > 0. Let H* C H be a family of multi- 
rectangles, each of which is associated with a collection of multi-tiles p*{R) C p(-R). 
Assume that the rectangles (-Rj^rgr* are pairwise disjoint. 
Assume also that for each R G R* we have 

(6-6) ^^,p.(^)(/)>A|/Hr/^. 
Then 

HeR* 

Proof. To simplify notation we will drop the j dependence of the various operators and 
will index them only by R. Since the rectangles Rj are pairwise disjoint, by Lemma 
16.81 it suffices to assume that they are A'^-separated with e = and to prove that 

ReR* 

We assume that ||/||2 = 1. We may also assume that for each R eH* 
(6.7) X\lR\'/'<Sn{f)<2X\In\'/'. 

To see this latter assumption, one can split R* into subcollections R* = IJa:>o ^^'^^ 
that for each i? e R^ we have 2''X\Ir\^/^ < Snif) < 2''+^X\Ir\^/^ . In other words, each 
subcollection satisfies fl6.7p with A replaced by 2'^A, and applying the estimate to each 
subcollection and summing a geometric series will prove the general form of the lemma. 
The proof that follows is a classical instance of the TT* argument. 
We have 

E \\TRif)\\l = 

rgR* ReR* 

= J2iTRT*M,f) 

ReR* 

< II E TRT*n{f)\l 

ReR* 
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which due to (16.31) imphes 

< E \\TR'im2\mTnh^2\\TR{f)h. 
Using this, hypothesis (16.71) and again (16. 3p we get 

ReR* R,R'eR* 

By symmetry and the fact that T^iTr = if ur. fl ojr;, = we have 

2 



Em 



< 



RgR* 



(6.8) A-2( |/i?J||r*T^||2^2+ E \lR\"^\lR'\"^\\TR'TRh^2] 

ReR* ReR* ii'GJffrcq(R) 

where J^freq(-R) = {R' e R* : R' R, \Ir'\ < \Ir\,ujr. n ujR>^ ^ 0}. It suffices to prove 
now that the term in (16.81) is 0(A~^ Srgr* I-^rD- '^'^ achieve this, we will first estimate 
the operator norms ||T^/Tr||2^2- 

The diagonal term is immediately seen to be 0(A"^ J^ReR* \^r\)^ (16. Sp . since 

||^A^r||2-^2 < II^-r||2-*2 ~ 1- To estimate the off diagonal, we first note that 

\{T*Mf),9)\ = \{TRif),TRig))\ 
Pj^p*{R)p'j&p-{R') 

(6.9) <A^ max \{<Pp^,<Pp')\ WfhWgh- 

P.-6p*(i?) 

p;.ep*(H') 

From Lemma 14.11 we know the bound 

(6.10) < (|^)'^'(l + |/p,r^|c(/p,)-c(/,,)|)-"'. 

If R' e J'freq(-R), it foUows that A^Ir n A^Ir' = since R, R' are A^ separated. This 
implies that 

(6.11) 1 + |JpJ-i|c(JpJ - c(V)| > max{l + \lR\-y{lR) - c(JrOU72}. 
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We use this and f l6.10p to argue that 

\{<Pv,Av')\ < {j^y\l + \lR\-MlR)-c{lH')\)-'A<'-''^ 

Thus 

(6.12) n'TRh^2 < ^2-^(^-2) (t^I'^V + l^«r'|c(//?) - c{In')\)-' 

We use this inequahty to bound the sum corresponding to the off diagonal term in 

del]) by 

(6.13) J2 E \lR'\i^ + \lR\-MlR)-c{lR')\)-'. 

Ren* iJ'e.Fi,eq(fl) 

Now we fix G R*. We note that if R' ^ R" and R' , R" G J^frcq(i?) then Ir, ^Ir„ = 0. 
With these observations we may write 



E \lR'\i^ + \lRnc{lR)-c{In')\)-'< E / Xi.^ 



< 



<\Ir\ 



Finally, we get 

( E i^^O' ^ E \ir\ + a'^'''-''' E i^^i) ^ E 1^ 

ReR* Ren* ReR* Ren* 



R\ 



□ 



Definition 6.11. (M-separated tiles in a tree) We say that a tree (T, pt) with top 
(/t,^t) is M-separated if MIr C Jx for each ReT. 

Lemma 6.12. Let i G {0,1,2^} \ {j}, M > 1, / G and A > 0. Let T be a 

collection of j -strongly disjoin!^ i-trees (T, pt) satisfying for each T G and each 

ReT 

(6.14) syif) < 10-Hr'/' X\Ir\'/' 

(6.15) AVT|/4<E^iT(/)'<AVT|. 

ReT 

Assume also that for each subtree T' C T G with top (/t',^t) we have 

(6.16) E^iT(/)'<AVT'|. 

ReT' 



"'^■^The lemma can be formulated without involving strongly disjointness in either the hypothesis or 
the conclusion; we choose this formulation since this is how the lemma will be applied. 
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Then for each T E J-" we can find a collection T C T such that the trees {(T,prj, := 
IJ^gj. p(i?, T)) : T G J-'} (understood as having the same tops as the original trees) 
with p{R, T) = p(-R, T) for each R G T, are M-separated, j-strongly disjoint and they 
satisfy 



Proof. To get the new trees we proceed by removing those tiles in a tree that are either 
too close to the edges of the tree top-tile or too close in scale to it. More precisely, for 
any i-tree T G we let /(T) = ^/t and we define the subcoUections 

Ti = {i?GT:/R</(T)} 

T2 = {ReT:In>I{T)} 

T3 = {i?GT:|/R|>(100M)-VT|} 
where //? < /(T) means sup{a; : x G Ir} < infja; : a; G /(T)}. Then, we define 

t = T\(TiUT2UT3). 
With these definitions we have by (16.161) 

ReTi 

and the same for T2, while by f l6.14p 



It easily follows that each tree T is M-separated. The fact that the new collection of 
trees is j-strongly disjoint is inherited from the initial collection. Finally, 

_Rgt ^ST i=l,2,3_ReTi 

1 2 2 2 1 2 

^ — A It — — A It ^ — A It . 

- 4 ' ^' 10 ' ^' - 20 ' ' 

□ 

Definition 6.13. (Rectangles M-separated in scales) We say that a collection R* of 
multi-rectangles is M-separated in scales if R R' & H* and \Ifj\ = \Ir>\ imply that 
dist(c(/K),c(/RO)>M|/«|. 

We are now ready to prove the analog of Lemma 16.101 for the case when the trees 
may consist of more than just one multi-rectangle. This lemma will be the main tool 
in dealing with the tree sizes. 
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Lemma 6.14. Let i e {0,1,2,3} \ {j}, M > 1, / e and A > 0. Let R* he a 

collection of multi-rectangles which is M -separated in scales. Assume we also have a 
collection T of j- strongly disjoint i-trees (T, px) with [Jrj.g^T = R* satisfying for each 
T eJ^ and each ReT 

(6.17) Syif) < 10-'M-'/'X\In\'/' 

(6.18) AVt|/4< 5^5^,t(/)'<AVt|. 

Assume also that for each subtree T' C T G with top (/t'j^t) we have 

(6.19) E^«,t(/)'<A^|/t'|. 

Then we have 

$:i^Ti<(i+AV-^)A-^ii/iii 

Proof. We will drop the j and T dependence of various operators. By normalizing we 
can assume that ||/||2 = 1- 

Moreover, by Lemma [6. 121 we can assume the trees in JF are M-separated, with some 
loss in the constants from (16.181) and (16.191) . Indeed, the new trees fabricated by the 
procedure in Lemma 16.121 have the same tops as the old ones so that the quantity 
^Tej^ |/t| does not change. 

The proof is another TT* argument, that follows along the lines of the one in Lemma 
16.101 Due to (16.181) we get as before 

(6.20) (a^EI^t|)'< J2 r^'(/)ll2im'riill2^2||T^(/)||2 

Te.F R.R'eR* 
\'r'\ = \Ir\ 

(6.21) + Yl \\TR'm2n'TRh^2\\TRif)h. 

R.R'eR* 

IIriKIIrI 

We estimate each term separately. By Cauchy-Schwartz, the first sum can be 
bounded by 

(6.22) 1 Y (IIW)i+rR'(/)ll^)l|T^T^'ll2^2. 

\Ir'\=\Ir\ 
-' J 

By using (16.91) and the fact that whenever p G p(-R), p' G p(-R') we have 

|/pj-'|c(/p,) -c(/p;)| > llnrMlR) - cilR')\. 
we can derive similar to (16.121) 

\\n'TRh-^2 < A'il + IIrI-MIr) - cilR'W, 
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whenever \Ir'\ = \Ir\. By using this inequahty in the case R' ^ R and the fact that 
1 1 T^Tr 1 1 2_»2 ^ I5 the term in (I6.22p can further be bounded by 

(6.23) EllW)ll2(l+ E A'{l + \Innc{lR)-c{lR'W). 

Finally, using the M- separatedness in scales, the term in (16.231) (and thus the term 
in (16.201) ) is bounded by 

\\TR{f)\\l{l + A^M-^) < \\l + A^M-^) |/t|. 

We will next concentrate on proving similar bounds for the term in (I6.2ip . We start 
with a few observations. Fix R and T such that R eT and denote 

W{R) = {R' e W : |/^,| < |/^|, T*,Tr ^ 0}. 

The first point we make is that if R' G R*(-R) then R' ^ T. This is an immediate 
consequence of Remark 16.11 Using this and the j-strongly disjointness we see that 
R' e K*{R) implies Ir> n /t = 0. 

Secondly, we observe that R' 7^ R" E R*(-R) implies that Iri fl Irh = 0. To prove 
this, we first note that the definition of R*(-R) guarantees the existence of T', T" G J-", 
pW^pC^) e p{R,T) and e p{R',T), p(^) G p(i?",T") such that u w C u o) 

and u; (2) C (4). Due to (15.16P and due to the grid structure it follows that ujr, C 

u (3) flu; (4). In particular, 00 (3) flu; (4) 7^ 0. We distinguish two cases. If {00 (s) \ = \uj (4)| 

Pj Pj Pj Pj Pj Pj 

then clearly Ir/HIr" = 0, since otherwise we would get R' = R". If \u (3) | < \uj (4) | then 

Pj Pj 

we first argue as above using Remark 16.11 that T' 7^ T", and then using the j-strongly 
disjointness that Ir" fl Jt' = 0. We conclude again that Ir' fl Ir" = 0. 

We next estimate ||T^/Tr||2^2 for R' G R*(i?). Note that due to the first observation 
above and the M-separatedness of the trees we have MIr fl Ir' = 0. By estimating 
like in (16.90 and (16.110 we then get 

\\Tr'Tr\\2^2 < + iirI-mir) - ciiR')\r'' 

< + IIrI'MIr) - c{Ir')\-\ 

We use (I6.17P to estimate 

\\TR'U)h<\\SR'U)h<M-'''\\lR,\'l' 
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and similarly for R. If we take into account all these observations, we can bound the 
term in fl6.2ip by 

AV-^X'J2I1 E \lR'\{l + \lRnc{lR)-c{In')\r 




< A'M'-^'x' Yl E + i^KrMist(c(/^), i^))-' 

Te:F ReT 

□ 

6.3. The Peeling Lemma. The following lemma will provide a decomposition of the 
multi-tiles in collections with good control over various sizes. 

Lemma 6.15. Let i G {0,1,2,3}, j G {1,2,3} and f G L^. Let p* C p be a finite 
collection of multi-tiles and let R* C R 6e a finite collection of multi-rectangles such 
that each p G p* belongs to some i? G R*. We assume that the collection R* of 

2 

multi-rectangles is M -separated into scales with M = A^-^ . We also assume that 
sizej,j(p*) < A. Then there is a collection JF of trees (T, px) with pt C p* such that 

Ei^t|<ai^a-^II/II^ 

sizej-i(^p* - y Pt) < A/2. 

Proof. We first describe the proof in the case we deal with tree sizes {i ^ j), which 
practically also contains the proof for the much simpler case of tile sizes. This will be 
briefly mentioned in the end of the proof. 

First we eliminate all the multi-rectangles which have big sizcj^j. We do that in 
order to set the stage for an application of Lemma 16.141 Consider the collection JFq of 
all singleton i-trees {R,p*{R,^)) with top (Ir,^), for some R G R*, C ^ some 
p*{R,0 ^ P*(^)> such that 

^^.P-(K,,)(/) > 10-^M-V2A|/^|V2. 

Note that a given R may appear in more than just one pair {R, p* {R, C,)) ■ Define J-'qq 
to be the collection of all R that contribute to JFq, that is {R, p*{R, Cr)) ^ for some 
CI 
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Start with T\ := and perform the following algorithm. Select some R G JFqo, with 
the additional property that 1^ is maximal with respect to inclusion, among all such 
R. It does not matter which one is selected, if there is more than one R that qualifies 
to be selected. Set T\ := T\ U {-R}. 

Define the vector ^/j G (7) as the one uniquely determined by the coordinate ^^r. if 
i 7^ 0, then is the i^^ coordinate of this vector, while if z = 0, it is either the first or 
the second coordinate, depending on whether (i?, p*(i?, ^)) is a 0^-tree or a 0^-tree. 

Define Tr := {R! G Tqq : -R^- fl Rj 7^ 0}. Set JFqo := JFqo \ Tr and restart the 
algorithm. 

Due to the maximality of Ir it follows that C Ij^ for each R G Tr. Let us observe 
next that 

This follows from Lemma 16.21 if R! = R, and from Lemma 16.31 if R' 7^ R. From fl6.ip 
we deduce that IJ^,^^^ p*(i?') can be split into the union of five trees, each of which 
with top interval Ir. 
It suffices to prove that 

J2 \lR\<A^^MX-^f\\l. 

But this follows immediately from Lemma I6.10[ since the rectangles {Rj)R^r^f^ are 
pairwise disjoint. This ends the first stage of the construction. 

In the second stage of the construction we perform the following algorithm 
Step 0: Initialize po := p* \ Uije^i S{IrXr). ^^'^ = 0, S^l = 0, J^j^ = 0, = 0. 
Step 1: Select an z-tree (T, px) with top (/t,^x); T C R* and px C po such that the 
following requirements are satisfied 

(i) (T, pt) is the maximal tree with the given top (/t,^t) ^^^^^ '^^^ t)e constructed 
out of the multi-tiles that are available. 

(ii) Eij.T^iT(/)'>iAVT| 

(iii) < Up. for eaclo p G px 

(iv) is maximal over all the trees that satisfy (i), (ii) and (iii) above. 
If no such tree can be found then go to Step 5. 

Step 2: Put the tree (T, px) in the collection J^^^^ and the multi-tiles S{It, ^t) in s2jL 
Step 3: Upgrade po := Po \ <S{It,^t) 
Step 4: Go to Step 1 

Step 5: Select an i-tree (T, px) with top (/x,^x)) T C R* and px C po such that the 
following requirements are satisfied 

(i) (T, Px) is the maximal tree with the given top (/x,'Cx) ^^^^ can be constructed 
out of the multi-tiles that are available. 

(ii) E,j.t^1t(/)'>|AVt| 



Recall that G (7) is the vector uniquely determined by the coordinate as in Stage 1. 
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(iii) C,^ > Up. for each p G px 

(iv) is minimal over all the trees that satisfy (i), (ii) and (iii) above. 
If no such tree can be found then go to Step 9. 



(2^1 ~* ('2') 

Put the tree (T, px) in the collection JF^ and the multi-tiles S{It, Ct) in 5aux 

Upgrade po := Po \ ^aux 
Go to Step 5 

Stop. The algorithm is over. 
The first part of Lemma 16.41 easily implies that if po denotes the value after the 
algorithm above ends, then sizej_j(po) < A/2. It suffices now to prove that 



Step 6 
Step 7 
Step 8 
Step 9 



E 



This will follow from Lemma 16.141 once we prove that both JF-*-^-* and J-'-'^^ consist of 

j-strongly disjoint trees. It suffices to prove this for J-'^^^ . 

We verify the second requirement of j-strongly disjointness. Assume for contradiction 
that there are two distinct trees (T,pt), (T',pt') G J^^'^^ and R E T, R' e T, p e 
p{R, T), p' e p{R', T') with Up. C Up/, and Ir' C /rp. By using Lemma fl5.16p and 

(iii) in Step 1 of the construction of both (T, px) and (T', pt), we get that > 
Thus, by (iv) in Step 1 we know (T, pt) was selected before (T',pt')- 

On the other hand, by the grid properties we know that uj^. C uj^r- We can then 

invoke Lemma [6.31 with / := Jx to conclude that p' G '5(/t,^t)- But then it is clear 
that p' was eliminated before the selection of the tree T', giving rise to a contradiction. 

We now verify the first requirement in the definition of j-strongly disjointness. As- 
sume by contradiction that i? G TflT' for some T, T' G J^i^\ and assume without loss 
of generality that T was selected before T'. But then, by Lemma 16.21 with / = Jx it 
follows that p(-R) C iS(/t,^t)- This means the whole p(-R) was eliminated before the 
selection of the tree T', giving rise to a contradiction. 

Note that the first stage of this proof is essentially what needs to be done when 
dealing with tile sizes {i = j). Precisely, at each step of the selection algorithm we 
search for singleton j-trees (i?, p*(i?, ^)j)) with top (J^, ^)j) such that 

and such that Ir is maximal with respect to inclusion. We eliminate the multi-tiles 
S{Ir,^r) from p*. Let JFqq be the collection of the selected multi-rectangles R. The 
fact that 

is an immediate consequence of Lemma [6. 101 and of the fact that the rectangles Ri with 
R G J-'oo are pairwise disjoint. 

As a final observation, we note that -due to fl6.ip - at each stage in the Peeling Lemma 
we eliminate with each multi-tile p G p(-R) all the multi-tiles in p(-R). □ 



MODULATION INVARIANT BILINEAR T(l) THEOREM 



51 



6.4. Size estimates. In this section we will see how to estimate various sizes. Before 
we do so, we recall two lemmata that will be used in the sequel with the words 'quar- 
tiles' or 'tri-tiles' replaced by 'multi-tiles'. 

Lemma A (Lemma 4.2 in [25j pg. 410) 

Let P be a finite collection of quartiles, j = 1, 2, 3 and let {ap^}p^p be a sequence of 
complex numbers. Then 



where T ranges over all trees in P which are i-trees for some i ^ j ■ 



Lemma B (Lemma 6.8 in [26j pg. 443) 

Let j = 1, 2, 3, Ej be a set of finite measure, fj be a function in X{Ej), and let P be 
a finite collection of tri-tiles. Then we have 



tpp Ut 1 



for all M with implicit constant depending on M. 

We can now state our lemmas. 
Lemma 6.16. Let f G X{E) and 1 < p < oo. For each i? G R, G M and I we have 

( 



For each % G {0, 1, 2, 3} \ {j} and each i-tree (T, pt) with top (It, ^x) have 

1 



Zli?.eT Z]pj6Pj(R,T) I (/' <^Pj) PA „ ^ 111 A^-ll 



(6.24) I — ---- I <C^sup^||l^x^ up, 

where 



= {/ dyadic: J/j C / C It for some ReT}, 
Ca = 0(^1/2) ifi G {0,3} and] G {1,2}, and Ca = 0(1) if i G {1,2}. 

Proof. This is a version of Lemma B above. Here we prove only the cases that are a bit 
different from the case in Lemma B. Namely we will only prove the second part of our 
lemma, and only in the case when i = 3 and (z = 0,j G {1,2}). These are the "worst 
case scenarios" because of the presence of A in the corresponding type of lacunarity. 
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Define a^^ = (X]pjGpj(_R t) I {fi4'pj) P)^^^- We first focus on the case z = 3. We note 
that 

aV<A sup \{fApj)? 
pep(i?,T) 

= A sup K/* 
pep(/?,T) 

<A\Ia\ inf Ml{f*l ){x). 
By Lemma A above, it sufficefl to show that for each subtree T' of T 

(6.25) \\(T.-%fr) ' ii^^^'^'iii^^t'ii- 

\i?eT' \ R\ J 

To simphfy notation, we will continue to write T rather than T'. and by writing 
/ = /i + /2 with /i = /I2/T5 it further suffices to prove fl6.25p for both /i and f2- 
In the case of /2 we use the decay of (pp. to write 



T| 



1 



T' 



which by summation and Holder's inequality proves (16.250 . 

To deal with /i we apply the Fefferman- Stein inequality first and then invoke the 
Littlewood-Paley theory and the rank property (4) to get 

II (T.^%fr) ' iip^^^/'ii(E^^i(/i*K)(^))'^'ii^ 

ReT 
< 41/2111 II 

^ A \\i-E\\p, 

where Mi{f) denotes the Hardy-Littlewood maximal function of /. 

Let us now briefiy see the case i = 0. The argument is very similar to above. Denote 
by the j*'' component of the vector C,t associated with as before. By modulation 
symmetry it suffices to assume that = 0. For each i? G T and each p G p(-R, T) let 
a;(p) be an interval of the form [e2'^,e2^+^] (e G { — 1, 1}, A; G Z) such that C uj{p). 
This is possible due to Lemma [6.41 We will estimate as before the term corresponding 
to /2, and then write for /i 

sup \Ir\ inf Mi(/i * i<^(p))(x). 



15i 



This reduction is made possible by and explains the presence of the collection Xt in (I6.24p . 
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□ 



By Lemma [6.41 and fl5.16p we know that for each interval uj := [e2^,e2^^^] there is at 
most one scale \Ir\ such that uj{p) = oj. The proof follows as in the previous case, by 
applying the Fefferman- Stein inequality and the Littlewood-Paley theory. 

Lemma 6.17. For each G R, ^ G M and I we have 

( V" 



\ 



< 



Ir I|2- 



/ 



For each i-tree (T, px) with i G {0, 1, 2, 3} \ {j} we have 

|2\ 1/2 



< 



I j^|l/2 ll'^-^-fx ll^- 



Proof. Use ([S3D and ([S3D with / := fxj"^ and 



r^^^, and note that 



{M_c(ujp.)(l)pj)Xirj, is L adapted of order 2 to Ip. and has the same frequency support 

as M_c(^^^)0p^. 

By interpolating between the previous two lemmas we get 



□ 



Corollary 6.18. Let f G X{E) and 1 < p < 2 and e > 0. For each i? G R, ^ G 
have 

\ 1/2 



we 



( 



\ 



p;Sp;(-R) 



< 



I 



< (supx/,(x)) — (inf M,{1e){x)Y/p. 



For each i-tree (T, px) with i G {0, 1, 2, 3} \ {j} we have 

|2\ 1/2 



< C sup 



|/|i/p 



N~2\ 



< 



^(sup x/(a;))^(sup inf Mi(li5)(x))^/P 



where C = A'^p'^ if i G {0,3} and j G {1,2}, and C = I if i G {1,2}. 

6.5. Proof of Theorem l5.5[ We may assume R is finite, and get bounds independent 
of R. At the expense of losing a factor of A^'^t^'^ in the bounds, it suffices to assume 

2 

that the collection R of multi-rectangles is M-separated into scales with M = A^-^. 
Since N can be taken arbitrarily large, all the factors of the form ^'^(iv) contributing 
to various bounds may and will be tolerated. 
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By scaling invariance we may assume that {Ej^l = 1. Define 

3 

n=[j{x: M,{1e^){x) > 100\Ej\} 
i=i 

and Ej^ := Ej^^ \ Q, and note that \Ejo\ > ^\Ejf^\. 

Let fj G X2{Ej), j 7^ jo and /,„ G X2{Ejg). All sizes sizcj^, are understood with 
respect to fj. We need to show that for some 6" < 6 

(6.26) \A{f,,f2,f3)\<A'"\E^nE2nEs\^\ 

where 7i := aj — |. We note that due to our restrictions, we have that < 7ji, 7j2 < | 
and + 7^2 = —C(jo < |- Thus we can find < Pi, P2 < \ sufficiently close to | such 
that aj-^ := 1 — ^ > 0, := 1 — > and aj^ := 2 — a^^ — aj.^ G (0, 1). 
We shall make the assumption that either 

(1) IrH {R\n)^^ for al0 i? G R, or 

(2) Ir = I for all i? G R, for some (fixed) dyadic I G Q with 

(6.27) 2' < 1 + dist(/,R\^^) ^ ^i+i^ ^^^^ ^^^^^^ ^ ^ 

and prove (16.261) for both case (1) and (2), with an additional multiplicative factor of 
2~^|J| in the bound, in case (2). If we can prove these special cases with the indicated 
gain, the general case follows by summation in I and /, since \Q\ < 1. We present the 
argument for case (1), and then will indicate how to modify it for case (2). 

Define Pji,Pj2 ^ (1, 2) such that Pi = h ~ p' I define = 2. Note 

that by Corollary 16.181 it easily follows that for each e > 

(6.28) maxsizej,,,(p) < |Ej-J^M^i+^ 

(6.29) maxsizej2,i(p) < lEj.y^A'^- 



(6.30) max size jo,i(p) < 1. 

i 

Fix j G {1, 2, 3}. We successively use the Pealing Lemma IB. 15[ simultaneously for each 
i G {0, 1, 2, 3}, to decompose p = [J^^'^^^^'^^^'' p^"''' such that p^"'^ consists of the union 
of a family J^l^^ of trees (T, pt), p^ ^ = Uxe^^^' satisfying 

(6.31) maxsize,-,(pi^'^) < 2^ 

(6.32) |/t| < A''^^)2-2^ 



16i 



The collection of multi-tiles is also appropriately restricted. 
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We get 

iA(/i,/2,/3)i< J2 E \iRrwtlm<i^P.)i 

where we implicitly assume that 

2^^ < maxsizej,j(p). 

By symmetry we may restrict ourselves to the case kj^ = maxj kj , for some G 
{1, 2, 3}. We can further estimate the sum above by 

(6.33) Yl E E i^«i"Vp3r/'flK/.,^.)i, 

where pt := Pt H p^'^'' fl p^"','', j,j' G {1,2,3} \ j*. Note that T gets partitioned by 

the intersection with various trees from J-'l^^^ and J^^"'*'*, where = {1, 2, 3} \ j*. 

By the final observation in the proof of the Peeling Lemma, two different such trees in 
some J^fc , / G {1,2}, will not share any multi-rectangle. It follows that we have the 

3* 

following natural partition 

T= [j Tr 

refl(T) 

Pt = U Pt, 

rGR(T) 

where (T^, px,) is a subtree of (T, px) which arises by an intersection, as described 
above. Due to the elimination of the saturations S{It',^t') in Step 2 and Step 6 of 
the algorithm in the Peeling Lemma (here T' is a generic tree in * , / G {1,2}), it 

easily follows that the trees (T^, Pt, ) can be assigned tops It, C which are pairwise 
disjoint for r G R(T). In particular, 

E i^T.i < i/t|. 

rG-R{T) 

We also note that each subtree (Tr,PTr) satisfies maxj sizej,i(pTr) < 2'^^ for each 
j G {1, 2, 3}. Using these observations and then invoking (16.21) and (I6.32p with j = j^:, 
we may estimate (16.331) by 



56 A. BENYI, C. DEMETER, A.R. NAHMOD, CM. THIELE, R.H. TORRES, AND R VILLARROYA 



where aj have been defined in the beginning of the argument. Finally, by invoking 
(16.28P - ( 16.30p we can further estimate the above by 

^ ^ 2^i(-^~"i)2^^^^~''^^2^^^"^~"^''(|£'' A^^^'^Y^"'^ (\E ■ |^2y^/32+e^l-a2 
fcl,fc2,fe3<0 

~ l-^Jll |-^j2l ^ ~ l-^Jll |-^j2l ^ 

This ends the proof in case (1), since aj^ > —6. To deal with case (2) we make the 
following modifications. Redefine fj := fjxj and (pp. := 0p^x7^' note that the new 
functions fj have the same properties as the old ones. Moreover {M_c(cjp.)(ppj)x7^ 

adapted of order — 2 to Ip^ and has the same frequency support as M_c(ujp^)(ppj- 
Then run the same argument as in case (1). As we said earlier, we expect the presence 
of |J| and 2~' in the bound for (16.261) . The presence of |J| is explained by the localized 
estimate 

||/,||^< |/|inf|E,|-'Mi(l^J(x) 

which becomes effective in the application of the Pealing Lemma in fl6.32p . The decay 
in / is due to the classical estimates 

inf Mi(l^J(x) < 2' inf Mi(l^J(x) < 2\ 



sup Xi{x) <2 \ 

which become effective in the application of Corollary 16.181 in estimating sizes. 

7. The general case: modulation invariant paraproducts 

Given any function / in BMO, we shall construct a trilinear form A satisfying the 
bounds in the conclusion of Theorem 12.71 (and thus also its assumptions) and also 

A(l, !,•) = / 
A(1,.,1) = 
A(.,1,1) = 

Such form and its symmetric counterparts under permutation of the three arguments 
will be called modulation invariant paraproducts. By subtracting three paraproducts 
we can reduce every trilinear Calderon-Zygmund form as in Theorem 12.71 to another 
one satisfying the special cancellation conditions A(l,l,.) = A(l,.,l) = A(.,l,l) = 0. 
Thus the construction of the paraproduct will finish the proof of Theorem 12.71 

Define 4>i, (f>2, (ps such that (pi = (j)2 = ^ and 03 = rio$ + T-io"^', where $ was 
introduced in (13. ip . Consider 

^{X) = [ 0i(x + (A - p3)t)M^ + iP2 - P3)t)(i)3ix) dt. 
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By using Fourier transforms it follows immediately that iJj is a. non zero Schwartz 
function. The function 

(7.1) + (A - (33)t)Mx + {[32 - f33)t)Mx) 

is easily seen to have zero integral in x for each t. Hence ip itself has mean zero. 
By Calderon's reproducing formula, we have (with some constant c) 

J = c J ^Wt^Wt- 
This well-know formula in also extends to distributions modulo polynomials pro- 
vided vanishes in a neighborhood of the origin. In particular, for / in BMO the 
formula holds in the sense 

/•I A dt 
/ = lim c y f *ipt*'^tY 

at least in the distributional sense when tested against bump functions with mean zero. 
Defining 

,PkAx) :=2-^/V(2-'x-n). 
a simple change of coordinates gives (with a new constant c) 

(7.2) /= lim c / / {f,i/jk,n)iJk,ndkdn 

with the equality holding in the sense of distributions when tested again functions with 

mean zero. 

If we define 0i,fc,„ by translation and dilation in the analogous manner, then we have 

'^k,n{x) = j 0l,jt,„(x (/3i - (3^)t)(t)2,k,n{x + (/32 - /33)t)(p3,k,n{^) 

Set 

Ck,n C (/, V'fc.n) 

Then for each k e R and each fi e «S(R) we define 



A«(/l,/2,/3) = / / / / Cfc,n 
^|fe|<«; Jm. Jr Jr 



Ylfi{x + Pit)(j)i,k,n{^ + Pit) 



.1=1 



dx dt dn dk 



3 

(7.3) - / / V\fi{x + l3it)K^{x,t)dxdt, 



Jr Jr 



where 

3 



t) = / Cfc,„ Yl <Pi,k,n{x + Pit) dn dk 

J\k\<KJR j^^i 
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Since / is in BMO, we have that |cfc,n| ^ 2^/^ (this is a particular instance of ( 17. Sp ). 
By discretizing the integral representing K^, we can write it as an average over [0, 1]^ 
of sums of the form 

3 



2-K<|/|<2'' 
/ dyadic 



with being L^-adapted to / and \ci\ < It is now an easy exercise to conclude 

that each satisfies (12 .ip and (12. 2p with 5 = 1 and with uniform constants C^^ = 
0(1). Moreover, K,^ is locally integrable since in particular \Ki^{x,t)\ < min{2'', 
These facts will easily prove that the integral in (17. 3p is indeed convergent for arbitrary 
Schwartz functions. 

Let now /i, /2, be three bump functions L?- adapted of order 2 to some interval 
J. We next prove an inequality that will justify some of our claims. Note first that for 
each X, f2{x + (/?2 — [3i)t)f^{x + (/?3 — (3i)t) is L^- adapted of order 2 (as a function of 
t) to some interval of size similar to \J\ with implicit constant 0(|J|~^/^). Similarly, 
4>2,i{x + {P2 — Pi)t)(p3j{x + {Ps — l3i)t) is L^- adapted of order 2 to some interval of size 
similar to |/| with implicit constant 0{\I\~^^'^). Moreover, this latter function also has 
mean zero with respect to t. By applying (14. 2 p to these functions and (14. ip to /i and 
we conclude that 

" 3 

Y[fi{x + (3it)(Pij{x + (3it) 



I dyadic 




dx dt 



I dyadic 



fi{x)(i)i,i{x) 



l[fi{x + (A - f3i)t)M^ + (A - f3i)t) 

i=2 

|c,||/|-i/Vr'/'(l+(max(|J|,|J|))-Mist(/,J))-^min((^y,('^' 

I dyadic 

Define now for each fi G S{M.) 

A(/i,/2,/3)= lim AM1J2J3). 



dt dx 



< 



< 



K— > + 00 



Since each Schwartz function is adapted to the unit interval centered at the origin, 
the above computations show that the above limit exists. Moreover, due to the earlier 
estimates for K^, the form A is associated with a kernel K satisfying (12. ip and (12. 2p 
with 6=1. 

Since the function 

Mx)Mx + ip2 - (3i)t)Mx + iPs - f3i)t) 

has mean zero in t for every fixed x, it is easy to verify using the kernel representation 
and the definition of Ak(., 1, 1) that Ak(., 1, 1) = 0. Then, by invoking (13. 4p and (13. 5p 
we conclude that A(., 1, 1) = 0. Likewise we see A(l, ., 1) = 0. 



-1/2 
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To see that A(l, 1, .) = /, we replace the integration variable x hj y = x + jS^t, then 
execute the integration in t to obtain for each compactly supported /s G S{R) with 
mean zero 



A«;(l, l,/3)= / / / Ck,ni'k,n{y)dndk 
Jr L-'|fc|<K Jr 



f3iy)dy. 



By (17.21) . the limit on the right hand side is J^f{y)f3{y)dy, and so we conclude that 
A(l, 1, .) is / as tempered distributions modulo constants. 

It remains to prove that A is bounded as in the conclusion of Theorem 12. 7[ Using 
that 



f\h)d{l), 



where dj-^ and ^(7) are, respectively, the normalized Lebesgue measures on 7-*- and (7), 
we note that for each /, e 5(M), A(/i, /2, /a) coincides up to some universal constant 
with 

3 



lim / / Cfc,n2" 



i=l ^ 



dn dl dk. 



Hence A is up to a universal constant an average of forms of the type 

3 

(7.4) Yl (^k,n2-''\{{fu(t>i,k,n,l), 

k,n,l&'L i=l 

where for some ko,no,lo G [0, 1] 



^i,k,n,l (•^ ) 



%,k+ko,n+no yXj& 



-27ri7i2-('=+'=o)(«+Zo)a 



The form (17. 4p is our basic model form that we wish to estimate. The function (j)i^k,n,i 
is an L^- normalized bump function adapted to the interval 

[2^'+'=o(n + no)), 2''+^''{n + no + 1)) 

By changing the bump function constants mildly, we can assume that the function is 
adapted to the dyadic interval Ik,n = p'^n, 2^{n + 1)). 

The function (f)i^k,n,i is supported in an interval uJi^k,i of length 2~'^"^^ and we may 
assume the following properties (in case i = 3 we split the generating function 03 into 
a sum of two generating functions, one with Fourier support contained in [—12, —8] 
and the other with Fourier support contained in [8, 12]. Without loss of generality we 
may replace 03 by one of the two): 

^l,k,l = ^2,k,l 

for all /c, and if 

^i,k,l ^ ^i,k',l' 7^ 

for some k, k', I, I', then for some universal constants 1 <^ C2 <^ Ci 
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and 

whenever i ^ j and at least one of i and j is equal to 3. By pigeonholing into finitely 
many summands if necessary, we also may assume that 

if n 7^ n'. We adopt the usual geometric picture that the parameter tuple {k, n, I) is 
identified with a triple p = {pi,P2,P3) of tiles 

Pi = Ip X ^p,i = Ik,n X i^i,k,l 

Compared to the theory of the bilinear Hilbert transform, the new element here is that 
two tiles in this triple are equal. This lack of separation is offset by better estimates for 
the coefficients Ck^n than in the model forms for the bilinear Hilbert transform. Since 
/ is in BMO, the coefficients Ck^n satisfy a Carleson sequence condition 

(7.5) Yl Kn\'<Cf\J\. 

This is what we need to know about the basic model operator (17. 4p we have to 
estimate. The proof runs parallel to the proof of boundedness of the bilinear Hilbert 
transform, e.g. in [28]. The only difference concerns the estimate on an individual 
tree for which both pi and p2 are overlapping and only p^ are disjoint. These trees are 
estimated by using fl7.5p as follows: 




1 a, 




\ip\ 


|l/2 



The factors in the last expression are as in [28] estimated by the tree sizes defined 
there. The rest of the proof is identical to the one in [28] and one obtains the same 
bounds as for the bilinear Hilbert transform. 

We close this section by mentioning an interesting application of the constructions 
we have performed above. For each dyadic interval / with length at least 1, let c/ be 
a coefficient selected in such a way that |c/| < |/|^^^ and such that 



(7.6) ^^PDT 5Z 



|c/p = oo. 



ICJ 
J dyadic 
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It can be easily seen by using (14.21) that 
(7.7) lim / V cii)i{y)h{y)dy 

\I\<K, 

exists for each Schwartz function /s with mean 0. Define 




\{fi{x + Pit)(t)ij{x + I3it) 



dx dt. 



A(/l,/2,/3)= 

I dyadic 

By reasoning as before, it is easy to check that the bihnear form A(., ., 1) satisfies the 
weak boundedness condition, and moreover that A(., 1, 1) = A(l, ., 1) = 0. Remark 12. II 
shows that these imply that A(., ., 1) is a bilinear form bounded on x . Also, by 
(17.71) it easily follows that the action on H^{R) atoms of A(l, 1, .) coincides with that 
of '^jCiipi- Due to (17.61) . A(l, 1, .) can not be identified with a BMO function. By 
invoking Remark 12.11 again, it follows that the bilinear form A(l, ., .) is not bounded, 
in spite of being completely represented by the same kernel as A(., ., 1). Moreover, the 
trilinear form is itself unbounded, since otherwise A(l, 1, .) would necessarily have to 
be a BMO function. Hence A(.,.,l) is a bounded bilinear Calderon-Zygmund form 
associated with a Calderon-Zygmund kernel that is not the restriction of a bounded 
trilinear form with the given parameter f3 and associated with the same K. 
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